Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2684 |
Symbol | |
ID | 4286136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2932800 |
End bp | 2935655 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638142183 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_757908 |
Protein GI | 114571228 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.801873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTGG ACCAGGACCT TGATTTCGGC ACACCGGCGC CGCGTGGCGC CATCCGGTCT GTCACGCTGG AGATAGACGG CTTCAATGTC ACAGTGCCGG AAGGCACCAG CGTCATGCGG GCCGCCTCGT CCCTGGGCAC ACAGATCCCG AAACTTTGCG CCACTGACAG CCTGAAGGCT TTCGGCTCCT GCCGGCTCTG CCTGGTCGAG ATCGACGGAA AGCGCGGCAC GCCGGCCTCC TGCACGACCC CGGTCGAGCC GGGCATGAAA GTCACGACCC AGTCCGACCG CCTGGCGCGC CTGCGCCGCG GGGTAATGGA GCTTTATATC TCCGACCATC CGCTCGATTG CCTGACCTGT TCGGACAATG GCGATTGCGA ACTGCAGGAC ATGGCGGGCG CGGTCGGCCT GCGCGATGTC CGCTATGGGC TCGCGGGCGA GAACCATGGC GGTCAGGACA AGGATATCTC CAACCCCTAC TTCGACTTCG AGCCGTCAAA ATGCATTGTC TGCTCACGTT GCGTTCGGGC CTGCGACGAG GTCCAGGGCA CGCTGGCGCT GACGGTCGAG GGGCGCGGTT TCGACTCGCA CATCTCGGTC GGTGGCCCGG ATTTCTTCAA CTCGGAATGT GTTTCCTGCG GCGCCTGTGT GCAGGCTTGC CCGACCGCGA CCCTGCAGGA GAAGACTGTC GTCGAGCACG GCGTGCCGGA CCGGACGGTG AAGACGACCT GCGCCTATTG CGGGGTGGGC TGTTCCTTCA AGGCCGAGCT GAAGGGCAAT CAGGTCATCC GCATGGTGCC GGACAAGGAT GGCAAGGCCA ATCACGGCCA TTCCTGCGTC AAGGGACGCT TTGCGTGGGG CTATGCCTCG CATGCCGACC GCATCACGGC GCCGATGATC CGCGACAGCA TTGACGAGCC CTGGCGCGAA GTGTCGTGGG ACGAGGCCAT CGGTTTCGCA GCGACGCGTC TCAAGGCCAC GCAAGCTGAA CACGGGCGCA AGTCGATCGG CGCCATTTCC TCCTCACGCT GCACGATCGA GGAAGTCTGG CTGGTCCAGC GCATGGTGCG CGCCGCCTTC TCCAACAACA ATATCGATAC CTGTGCCCGG GTCTGCCATT CGCCGACCGG CTATGGCCTC AAACAGACCT ATGGCACCTC CGCCGGCACA CAGGATTTCG ACAGTGTCGA GGACGCGGAC GTCATCCTGC TGATCGGCGC CAATCCGACC GACGCGCACC CGGTCTTTGC CTCGCAGATG AAGCGTCGCC TGCGCGAAGG GGCCAAGCTG ATCGTCGCCG ATCCGCGCGC CATTGACCTC GTTCGAACGC CACATGTCGA AGCCGCGCAC CATCTGGCTC TCCAGCCCGG CACCAATGTC GCCCTGGTCA ATGCGCTGGC CCATGTCGCC GCCACCGAAG GCCTGCTCGA CGAGGATTTC GTCGCCGAGC GCTGTGAGAC CGACAGTTTT GCCGACTGGA TCGACTTCAT CCGAGACGAC GCGAACTCGC CGGAAGCCGC GGCCGCGATC ACCGGTGTTC CGGCAGACGA CATCCGCGCC GCGGCCCGTC TCTATGCCGG CGGCGGCAAG GCGGCGATCT ATTACGGGCT GGGCGTCACC GAGCACAGCC AGGGTTCGAC CATGGTCATG GGCATGGCCA ATCTCGCCAT GGCAACCGGC AATATCGGGC GACGGGGTGT CGGGGTGAAC CCGCTGCGCG GGCAGAACAA TGTCCAGGGC TCGTGCGATA TGGGCTCCTT CCCGCATGAA TTCCCCGGTT ACCGGCATGT CAGCGATGAT GCGACCCGGG CGATCTTCGA AACGGCGTGG AAGCGCCCAC AGGACGCCGA GCCGGGCTTG CGCATTCCCA ACATGTTCGA CGAGGCCTGC GGCGGGACGT TCAAGGGGCT CTATGTGCAG GGCGAGGACA TCGCCCAGTC CGATCCCAAC ACCCAGCATG TCGAAGCGGC GCTGCGGGCG CTGGACATCC TGATCGTGCA GGACCTCTTC CTCAACGAGA CTGCGCGCTT TGCCCATGTC TTCCTGCCCG GCACCTCCTT TCTGGAAAAG GATGGCTGTT TCATCAATGC CGAGCGCCGC ATCAACCGGG TCCGGCCGGC CATTCCCGTC AAGACCGGCA TGGCCGAATG GGCCGTCACC CAGTCCCTGA CCCTGGCGAT GGGATATGAC GAGACGGTCT TTTCCAGCAG TGCCGAGATC ATGGACGAGA TCGCCACTCT GACGCCAACC TTTGCCGACG TCTCCCATGC GCGGCTGGAT CGTGATGGCA GTGTGCAATG GCCCTGTAAC GAGGCCGCAC CACAGGGTAC GCCAATCATG CATGTCGACG GCTTTGTGCG TGGCAAGGGA CATTTCGTGA TCACTAAATT CGTGCCGACG ACCGAACGCG CCAACCGCAA ATTCCCGCTG ATCCTGACCA CCGGCCGGAT CCTCAGCCAA TACAATGTCG GCGCCCAGTC ACGGCGGACC GAGAATGCCC GCTGGCATGA CGAGGATGTG CTGGAGCTGC ATCCCAGTGA CGCCGAAGCA CGCGGAATTC TCGATGGGCA TTGGGTCTCG GTGCGCAGTC GCAAGGGTGA GACCACCTTG CGCGCCCGGC ACTCCGAGCG GATGGCGCCG GGCGTCGTCT ATACCACTTT TCACCATCCC GAGACCGGCG CCAATGTGGT GACAACCGAA AACTCGGACT GGGCGACCAG CTGCCCGGAG TACAAGGTCA CCGCGGTCGA GGTCGCACCG GCCAATCACC GTTCGCCCTG GCAGGAAGAG CGCTCCGGTT CCGCCCGCGA AGCGCGCCGG ATCAAGCGGG ACGACCATGT CGAGCCGGCG CAGTGA
|
Protein sequence | MPLDQDLDFG TPAPRGAIRS VTLEIDGFNV TVPEGTSVMR AASSLGTQIP KLCATDSLKA FGSCRLCLVE IDGKRGTPAS CTTPVEPGMK VTTQSDRLAR LRRGVMELYI SDHPLDCLTC SDNGDCELQD MAGAVGLRDV RYGLAGENHG GQDKDISNPY FDFEPSKCIV CSRCVRACDE VQGTLALTVE GRGFDSHISV GGPDFFNSEC VSCGACVQAC PTATLQEKTV VEHGVPDRTV KTTCAYCGVG CSFKAELKGN QVIRMVPDKD GKANHGHSCV KGRFAWGYAS HADRITAPMI RDSIDEPWRE VSWDEAIGFA ATRLKATQAE HGRKSIGAIS SSRCTIEEVW LVQRMVRAAF SNNNIDTCAR VCHSPTGYGL KQTYGTSAGT QDFDSVEDAD VILLIGANPT DAHPVFASQM KRRLREGAKL IVADPRAIDL VRTPHVEAAH HLALQPGTNV ALVNALAHVA ATEGLLDEDF VAERCETDSF ADWIDFIRDD ANSPEAAAAI TGVPADDIRA AARLYAGGGK AAIYYGLGVT EHSQGSTMVM GMANLAMATG NIGRRGVGVN PLRGQNNVQG SCDMGSFPHE FPGYRHVSDD ATRAIFETAW KRPQDAEPGL RIPNMFDEAC GGTFKGLYVQ GEDIAQSDPN TQHVEAALRA LDILIVQDLF LNETARFAHV FLPGTSFLEK DGCFINAERR INRVRPAIPV KTGMAEWAVT QSLTLAMGYD ETVFSSSAEI MDEIATLTPT FADVSHARLD RDGSVQWPCN EAAPQGTPIM HVDGFVRGKG HFVITKFVPT TERANRKFPL ILTTGRILSQ YNVGAQSRRT ENARWHDEDV LELHPSDAEA RGILDGHWVS VRSRKGETTL RARHSERMAP GVVYTTFHHP ETGANVVTTE NSDWATSCPE YKVTAVEVAP ANHRSPWQEE RSGSAREARR IKRDDHVEPA Q
|
| |