Gene GM21_2835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2835 
Symbol 
ID8138178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3299663 
End bp3302056 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content66% 
IMG OID644870437 
Product4-hydroxybenzoyl-CoA reductase, alpha subunit 
Protein accessionYP_003022626 
Protein GI253701437 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR03194] 4-hydroxybenzoyl-CoA reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones93 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACCACAGCGT AATAGGCCGC AGCGTCCCCC GCATCGACGG GCCGGAGAAG 
GTCACCGGGG CGGCCAAGTA CACCGGGGAC CTGAAGTTCC CCAACATGCT TTACGGCAAG
ATCCTGACGA GCCCCCACGC CCATGCCCGG ATCCTCTCCA TCGACACCTC CGAGGCGGAG
CGTCTTCCCG GGGTGAAGGC GGTGATCACC CACAAGGACG TGCCGACCTT GAAGTACGGC
CTGAGCCCGG CCCGCTGGGA CGAGAGCATC TTCTGCAGCG ACAAGGTCCG TTTCGTGGGG
GACAAGGTGG CGGCCGTGGC CTGCCTGGAC GAGGACACCT GCTACAAGGC GCTGAAGCTG
ATCAAGGTGG AGTACGAGGT GCTCCCCGCC GCTCTCGACT TCCTGCATGC CATGGACGAG
GGGCAGCCGC AGGTGCACGA AGAGTACGCG AGAAACATCA ACACCGAGAT CCACCAGGAG
TTCGGGGACG TGGAGAAGGC GCTCGCCGAG GCGCACCACG TGCGCACCGA CACCTTCGTG
GGGCAGAGGA CCTACCAGTC ACCCATCGAG CCGCACTCCG CCATCTCCAT GTGGGACGGG
GAGAAGCTCA CCATCTACTC CAGCACCCAG TCGCCGCACT ACTTCCAGCA CTACATCGCC
CGCGAGTTCG ACATGCCCAT GGGTGACGTG CGCATCATCA AGCCCTACCT CGGGGGCGGT
TTCGGCGGCA AGCTGGAGCC GACGGGGCTC GAGTTCGCAG GCGCCGTGCT GGCGAAGCTG
ACCGGCCGGC CGGTTAGGAC CTTTTACGAC CGCGCCGAGA TGTTCGCCCA CAACCGCGGG
CGGCACGCCC AGTACATGGA GATCACCACC GGCGTGGACA AAAACGGCAA GATCCTCGCC
GCCAAGGCCA ACTTCCTCAT GGACGGCGGC GCCTACACGA GCCTCGGCAT CGCGAGCGCC
TACTACGCCG GCGCTCTGCT CCCGCTCACC TACGAGTTCG ACAACTACCA GTTCGACATG
TTCCGGGTCT ACACCAACCT CCCCGCCTGC GGCGCCCAGC GCGGCCACGG CGCCCCCCAG
CCCAAGTACG CCTTCGAGAG CCACCTGGAC AACGTGGCGG CGGACCTGGG AATCGACCCG
ATGGACATCA GGATCATCAA CGCCCGGCGC CCGAACACGG TCACCCCCAA CGACTTCCGG
GTCAACTCCT GCAAGATCAA GGAGTGCCTG GAGCGGGTGC GGGTGATGTC GGACTGGGAC
GAGAAGAAGA AAAACCTCCC CCTGGGGAGG GGGATAGGGG TCGCCACCGG GAGCTTCGTC
ACCGGCGCGG GGTATCCCAT CTACCGCACC GACCTGCCGC ACGCCGCCGC CTTCATCAAG
GTCTGCGAGG ACGGCACCGC CGCCACCCTC TACACCGGAT CGGTGGACAT AGGGCAGGGG
TCGGACACCG TGCTCTGCCA GATGGCGGCC GAGGCGATGG GGTACCGCTA CGAGCAGATG
AAGATCGTCG CCGCCGACAC CGAGATCACC CCGCTCGACT TCGGCGCCTA CGCGAGCCGC
CAGACCTACA TGTCCGGCGC CGCCGTGAAG CAGGCCGGCG AAGAGGTGAA GCGGCAGATC
CTGGAGATGG CCTCCAGCAT GCTGGGGCTT CCGGCGGACG ATCTGGAGTG CGACGACGGC
AAGGTCTTCT CCAAGTCACG TTCCGGGAAG AGCCTCAGCT TCGAGGAGGT GGCCAGGAAG
CACTTCGTGC TCCGTGGACC GCTTCTCGGG CGCGGCTCCT ACACCCCGCC CAAACTCGGC
GGGAGCTTCA AGGGCGCTGC CGTCGGCACT TCCCCCGCAT ACAGCTTCGG GGCCCAGGTG
GGAGAGGTGG CCATCGACGA GGAGACCGGC GAGATCACCG TGGTCGGTAT CTGGGACGTG
CACGACTGCG GCAAGGTGAT CAACCCGCGC CTTCTGCACG GCCAGGTGCA CGGCGCCCTC
TACATGGGTA TGGGGGAGGC GGTCTGGGAG GAGGTCCTCT TCGACGACAA GGGGCGCATC
AAGAACGCGG AGCTCGCGAA CTACCGCCTC CTGACCGCCG TGGACATGCC CCCCATCACC
TCAGAGGTGG TGGACAGCTA CGAGCCGAGC GGCCCCTGGG GGGTGAAGGA GGTGGGCGAA
GGGGCGACCA ACCCGACCTT GGGTATGTTC TCCAACGCCA TCTTCGACGC CATGGGGGTG
CGGGTCAATT CGCTGCCGCT TAGCTACGAG AAGGTGTGGC GCGCCCTGAA GGAAAAGCGC
GAGCGGGAGG AGATTGCCAA GCGGGAAGAC GCCCAAAGGG GACTGGCTCC GCAGGTGCCT
GTCCCCCTTG AGAGTGAGTC CGCATCGGAG CCCATTCACG CGAACCCGAG CTGA
 
Protein sequence
MSDNHSVIGR SVPRIDGPEK VTGAAKYTGD LKFPNMLYGK ILTSPHAHAR ILSIDTSEAE 
RLPGVKAVIT HKDVPTLKYG LSPARWDESI FCSDKVRFVG DKVAAVACLD EDTCYKALKL
IKVEYEVLPA ALDFLHAMDE GQPQVHEEYA RNINTEIHQE FGDVEKALAE AHHVRTDTFV
GQRTYQSPIE PHSAISMWDG EKLTIYSSTQ SPHYFQHYIA REFDMPMGDV RIIKPYLGGG
FGGKLEPTGL EFAGAVLAKL TGRPVRTFYD RAEMFAHNRG RHAQYMEITT GVDKNGKILA
AKANFLMDGG AYTSLGIASA YYAGALLPLT YEFDNYQFDM FRVYTNLPAC GAQRGHGAPQ
PKYAFESHLD NVAADLGIDP MDIRIINARR PNTVTPNDFR VNSCKIKECL ERVRVMSDWD
EKKKNLPLGR GIGVATGSFV TGAGYPIYRT DLPHAAAFIK VCEDGTAATL YTGSVDIGQG
SDTVLCQMAA EAMGYRYEQM KIVAADTEIT PLDFGAYASR QTYMSGAAVK QAGEEVKRQI
LEMASSMLGL PADDLECDDG KVFSKSRSGK SLSFEEVARK HFVLRGPLLG RGSYTPPKLG
GSFKGAAVGT SPAYSFGAQV GEVAIDEETG EITVVGIWDV HDCGKVINPR LLHGQVHGAL
YMGMGEAVWE EVLFDDKGRI KNAELANYRL LTAVDMPPIT SEVVDSYEPS GPWGVKEVGE
GATNPTLGMF SNAIFDAMGV RVNSLPLSYE KVWRALKEKR EREEIAKRED AQRGLAPQVP
VPLESESASE PIHANPS