Gene GM21_3382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3382 
Symbol 
ID8138749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3913697 
End bp3915118 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content63% 
IMG OID644871000 
Producthopanoid biosynthesis associated radical SAM protein HpnJ 
Protein accessionYP_003023165 
Protein GI253701976 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID[TIGR03471] hopanoid biosynthesis associated radical SAM protein HpnJ 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones107 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCGT TGTTTCTGAA CCCGCCGACC TTCGAGGACT TCGACGGGGG GGCCGGTGCG 
CGCTACCAAG CCTCCCGCGA GGTTACCTCC TTCTGGTTTC CCGGCTGGCT TACCTACCCC
GCAGGGATGA TCCCCGGAGC CCGGGTCGTG GATGCCCCGG TGCAGCGGCT TGACCTCGAT
GCCTGCCTCA ACATCGCGAA AGAATACGAC ATGGTGGTGA TGTACACCTC CACCCCGACC
CTTGCCATCG ACGTCGAAAC CGCCCGTCGC CTGAAAGCCC AGAACCCCGC CACCGTCACC
GTCCTGACCG GCCCCCACGT AAGCGTCCTC CCCGAGGAAT CGCTTCGCTT CGCCGCCGGC
GCCGTCGACA TAGTCTGCCG CGGCGAGTTC GACTACTCCA CCAAGGAGCT TTGCGAAGGG
AAGCCGCGTG CGGAGGTGGA CGGCATCAGC TTTCTGCAGG ACGGCAAGGT GGTCCATACC
AAGGACCGCC CCCCCATCGC CGACCTCGAC AGCCTCCCCT TCGCAAGCCA GGTCTACCAC
CGCGACCTTC CCATCGAGGA ATACGTCATC CCGCATTTTC GGCATCCGTA CGTCTCCATC
TACGCCTCCC GCGGCTGCCC CTCCCGCTGC ATCTACTGCC TGTGGCCCCA GACCTTCTCC
GGGCGGACCC TGAGGAAGAG GAGCCCGCAG AACGTCTACG AGGAGGTGCG CTGGATCAAG
GAGAACCTCC CCCAGGTGAA GGAGATCTCC TTCGACGACG ACACCTTCAC CGCCGACCGC
GAGCACGCGA AAGCGATCGC CCGGGCCATC AAGCCGCTCA ACGTCTCCTG GGTGATCAAC
GCGCGGGCCA ACTGCGACTA CGAGACGCTC AAGGAGTTGC GCGACGCCGG GATGCACCAC
GTGGTGGTCG GCTACGAGAC TGGGAACGAA GAGATCCTGA AGAACATCAA GAAGGGGGTC
ACCAAGGCGC AGGCGATAGA GTTCACCCGC AACTGCCACA AGTTAGGCCT CACCATACAT
GGCGCCTTCG TCCTGGGGCT CCCCGGGGAG ACCAGGGAGA CCATCAAGGA AACCATCGCC
TACGCCATAG ATCTCAACCT CACCTCGATC CAGGTCTCGC TCGCCTCCCC CTATCCCGGC
ACCGAGTTCT ACGACATGGC GGTGCGGGAG GGTTGGATCG CCTCGGACAG CTTCCTGGAC
GCCAGCGGGC ACCAGAAATG CGTCATCAAC TACCCCGACC TCTCGAACCG GGAGATCTTC
GACGCGGTAG AGCTCTTCTA CAACAAGTTC TATTTCCGCC CCCGCTACAT CGCGCGCAGC
CTCTACAGCA TGCTGGTCGA CTCCGCCCAG CGCAAGAAGC TCCTCAAGGA GGGGGCGCAG
TACCTGAGCT ACATGAGAAA GCGCAAGCAG GCCTGCGCCT AA
 
Protein sequence
MKPLFLNPPT FEDFDGGAGA RYQASREVTS FWFPGWLTYP AGMIPGARVV DAPVQRLDLD 
ACLNIAKEYD MVVMYTSTPT LAIDVETARR LKAQNPATVT VLTGPHVSVL PEESLRFAAG
AVDIVCRGEF DYSTKELCEG KPRAEVDGIS FLQDGKVVHT KDRPPIADLD SLPFASQVYH
RDLPIEEYVI PHFRHPYVSI YASRGCPSRC IYCLWPQTFS GRTLRKRSPQ NVYEEVRWIK
ENLPQVKEIS FDDDTFTADR EHAKAIARAI KPLNVSWVIN ARANCDYETL KELRDAGMHH
VVVGYETGNE EILKNIKKGV TKAQAIEFTR NCHKLGLTIH GAFVLGLPGE TRETIKETIA
YAIDLNLTSI QVSLASPYPG TEFYDMAVRE GWIASDSFLD ASGHQKCVIN YPDLSNREIF
DAVELFYNKF YFRPRYIARS LYSMLVDSAQ RKKLLKEGAQ YLSYMRKRKQ ACA