Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aave_3925 |
Symbol | |
ID | 4666615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax citrulli AAC00-1 |
Kingdom | Bacteria |
Replicon accession | NC_008752 |
Strand | - |
Start bp | 4348824 |
End bp | 4351751 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639825113 |
Product | cytochrome c, class I |
Protein accession | YP_972242 |
Protein GI | 120612564 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.073446 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGC CGGACCGTTC CATCACGCCT CTTGCGACCG AGGTTCCCGG CGCGCCCGCG GATCTGCTGC ACGGGCTGGC CGTGCGGCCC GGCCGCATGG GCTGGGACGG GCTGCGCTAC CAGGGCGACG CACTGGCCGA TGCCCGGCTG GACGATGCGC GCGCCATGGC CGGCGTCGTG GCCACGGTGC GCATCGGGAA TTTCCTGGGC GTGGTCGCCA TGGCACCGGT GCATGCGCAG CAGGCAGCGG TGGCCCTGGC GCCGGTCTGG CGCGGGATGG ATGCCGGTGC GAGCACACCG CTGCCCGAAG CCCCGGCGGG CGATGCGTTC GTCTGGCGAC CGCCCCGCAC CGATGCGCCT GCCGGCGCGC GTGCCGTCGC CTGGTGCATG GAGGGCCATG CCAGCCTGTG GCTGCCCGCC TGCGCACCGG ATATACAGGC CCTGGTCGCT GGTGAAATCG CGGCGCTGCT GCAGCTGCCC GTGCAGGCTG TGCGCCTGTT CGTGCTGGAC GGCGCCGCGC CGCACGCGCT GGCGTTGATG GACGCGGCGG CCGACGCGGC GCTGCTGTCC CGGGCCGTGG CCCGGCCCGT CAGCGTGGCG TGCGCGGCGG GCGTCGAGCC GTCCGCGCTG GCGCTGCAGC CAGCAGCTCT GCCCTCCATG GACACCGTCG CCGGCACCGC GTTGCGTCCG GCCGAAGGTG CTGCGCAAGC GCCAGCCCCT TTTGTTCTTT CCTCGCCTCT GCCCTGGGCC GTGCGGCCCA GTTGCGCACG CTTGCTGAGC CATTCCGATG CGGCCCGCGC AGCGGCCCTG CCCAGCGTGC AGGACGCCGG CACCGTCCAG GGCGGACGCC CTGCGCGCGG CCTCTCTCAG GCCGGGGTGG ACGATCTCAA TGCCGCCCAG GTGTTCGCCA GCGAAAGCCT GTGGCACGAA CAGGCCCTGG AGCAGGGGCA GGATCCGCTG GACTGGCGCC TGCGGCACCT GCCCGAAGGC CCGGCGCGCG ACCTGGCCAC GCGCGTGGCG CGGAGCGCGC CGTGGCCGGG CGCCGGACGG GAATCCGACG GCCGCCTGCA CGGCCGGGGC TTCGCGACCG CCTGCATGCA GACCCTCGAC CCCGAAGGGC GGGACAGCGT GGTCTGGAGC GCCTGGGTCG CCGAGGTGGC CGTGCATCCG CAGACCGGCG AGATCGAAGT CACCCGCGTG GTCGCGGGCC ACGACAGCCA GCACCTGCGG GAAGCGCAGG GCGCCGGCGT GCATACGCAG ATCGTGGAGC AGGACGCGCA CTGGCTGGCC GGCGCGCGCC GGCTGCTGGG CGCGCCGCCC GCCTTCGATG GCTGGCGCAG CCCTGCAGGC GCAGCCCCCA CCGATGCGCC GACGTTACGC GGTGCGCTCG CGGCCGACGC TGCGGGCGAC GGCACGGTCG TCCGCCATGG CCAGCTGGAC CTGGACGGCG TGGCCACCCT GCCCGCCGCC GCCGCGATCG CCAATGCCAT CCGCCATGCC ACGGGTGTGC GCCTGCGCGA AGTGCCCTTC CAGACCGAGC CGCTGCGCCT CGCGCTGGCC GGCGAGGCAG GTGCGGGCGC ACAATCCCCG CGCAGCACGC TGCGCCGCGG CCTGGGCTGG ATTGCGGCCG GTGCCGCGGG CATCGCGGGC CTGGCCGCCA TGGCGTGGCC GATGAAGCCG GCGTTGCCGC TCACCGACGG GCCCGATGCA TCGCTCTACT CGCAGCAGGC CATCGAGCGC GGCCGCCTGG TCGCGGCGGC CGGCGACTGC GTCGTCTGCC ACACCGCGCC GGGTGGCGCC CCGAACGCGG GCGGCCTGGG CCTGGAGACG CCGTTCGGCA CCATCTACTC GACCAACATC ACGCCCGACG AAGAAACCGG CATCGGCCGC TGGTCCTACG CGGCCTTCGA GCGCGCCATG CGCCAGGGCA TCCACCAGGA CGGCCGACAG CTCTATCCGG CCTTCCCGTA CACGGCCTTC GCCAAACTCA GCGACGGCGA CCTGCAGGCG CTCTACGGCT ACCTGATGTC CCGGCCTGCC GTGCAGGCCC GGCCGCCCGA AACGCGGCTG CCTTTTCCCT ACAGCCTGCG CCCGGCCATG GCCGGCTGGA ACCTGCTGTT CCACGATGCC ACGCCCTTCC GCGCCGATCC GGCCCGCAGC GCCGAATGGA ACCGGGGCGC CTACCTGGTG GAAGGCGCGG GCCACTGCGC GGCCTGCCAC TCGCCGCGCA ACGCGCTGGG CGCCGAGAAG GGCGGCATCC ATTACCTCGG CGGCGGCGAG GCCGAGGGCT GGAGCGCACC CGCCCTCGAC CGGTTGGCTG ACGGCCCCCG GCCCTGGAGC CGCGAGTCGC TCTACCAGTA CCTGCGCGCG GGGTTCTCCG CGCGCCACGG CGTGGCCGCC GGCCCCATGG CTCCCGTGAT CCACGGTCTG GCCGAACTGC CCGATACCGA TGTGCGCGCC ATCGCCACCT ACCTGCTGGA GCTGCCCGGC CGGGCGCCGC AGGCCGCCCC GGCCCCGGAG CCCGCGCCAG TCGCCGTACC GGCGGCGGCC GCAGCGGTGC CGGTGCCCGC GCGCACCCTG GACCGCCACG CCAACGGCGA GCGCATCTAC CAGAACGCCT GCGCCGTCTG CCACGAGGCC GGCAGCGGCC CGACCCTCTT CGGCGCCAAG CCGCAGCTGG CGCTCAACAC CAACCTGCAT GCCGCCACGC CGGACAACCT GGTCCAGGTC ATCCTCCACG GCATACAGCA ACCCGCCAAC GACGCACTGG GCTACATGCC GGGCTTCGGC GACAGCCTGG ACGACCGGCA GATCACCGAC CTGCTCGGCT ACCTGCGCGC CCGGTTCGCA CCGGAGGAAA AAGCGTGGCC GCACGACACG GACACGGTCC GCCGGCTGCG TGCCACGGCG CACGCGGGCG CGCCATGA
|
Protein sequence | MNLPDRSITP LATEVPGAPA DLLHGLAVRP GRMGWDGLRY QGDALADARL DDARAMAGVV ATVRIGNFLG VVAMAPVHAQ QAAVALAPVW RGMDAGASTP LPEAPAGDAF VWRPPRTDAP AGARAVAWCM EGHASLWLPA CAPDIQALVA GEIAALLQLP VQAVRLFVLD GAAPHALALM DAAADAALLS RAVARPVSVA CAAGVEPSAL ALQPAALPSM DTVAGTALRP AEGAAQAPAP FVLSSPLPWA VRPSCARLLS HSDAARAAAL PSVQDAGTVQ GGRPARGLSQ AGVDDLNAAQ VFASESLWHE QALEQGQDPL DWRLRHLPEG PARDLATRVA RSAPWPGAGR ESDGRLHGRG FATACMQTLD PEGRDSVVWS AWVAEVAVHP QTGEIEVTRV VAGHDSQHLR EAQGAGVHTQ IVEQDAHWLA GARRLLGAPP AFDGWRSPAG AAPTDAPTLR GALAADAAGD GTVVRHGQLD LDGVATLPAA AAIANAIRHA TGVRLREVPF QTEPLRLALA GEAGAGAQSP RSTLRRGLGW IAAGAAGIAG LAAMAWPMKP ALPLTDGPDA SLYSQQAIER GRLVAAAGDC VVCHTAPGGA PNAGGLGLET PFGTIYSTNI TPDEETGIGR WSYAAFERAM RQGIHQDGRQ LYPAFPYTAF AKLSDGDLQA LYGYLMSRPA VQARPPETRL PFPYSLRPAM AGWNLLFHDA TPFRADPARS AEWNRGAYLV EGAGHCAACH SPRNALGAEK GGIHYLGGGE AEGWSAPALD RLADGPRPWS RESLYQYLRA GFSARHGVAA GPMAPVIHGL AELPDTDVRA IATYLLELPG RAPQAAPAPE PAPVAVPAAA AAVPVPARTL DRHANGERIY QNACAVCHEA GSGPTLFGAK PQLALNTNLH AATPDNLVQV ILHGIQQPAN DALGYMPGFG DSLDDRQITD LLGYLRARFA PEEKAWPHDT DTVRRLRATA HAGAP
|
| |