Gene GM21_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3541 
Symbol 
ID8138913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4090237 
End bp4094853 
Gene Length4617 bp 
Protein Length1538 aa 
Translation table11 
GC content67% 
IMG OID644871160 
Producthypothetical protein 
Protein accessionYP_003023320 
Protein GI253702131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones145 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAG TGATCGTTAA AAATATCAGG GGCATGGGAT GGGGTGCGAG AACCTGCCTG 
GTCGCGGTGT TCACCATTTT TGCTGTCGTG CTCTGCTTGC AGATGCGCGA CGCCAGGGAC
GCCCAGGCGG CGGTAGCGGT ACAGACCCAG TGGGCCATCC TCGGGACCGG TAGCACCTCG
ACCCTTCCGA CCATGACGCT GGCCAAAGGG GCGGGGAATA ACCGCCTGCT GGTGGTCAAG
GTGGTGGCCG AGTACAGCTC GTCCACCTCC ACCTTCACCC CGACGGTCAG GTACGGCGGA
CAGACCCTGA CCAAGATCGT GGCTACCGAC ACAACGAGCA ACCAGAAGGT CTGGTTCGGC
TATCTGAAGG AAACGGGGAT CGTTGCGGCG ACCGGCACCA CCCCGACCCT GACGGTGACT
TGGAGTTCCA CGCCGAACTC GGGCGTCGGC GTGTCCGCCG CCTTCTACTC CAACGTGGAC
CAGGGCTCGC CCATCACCGG CTCCCGCGCG GTGGGCTCGA CCAGCTCGGC CACGACACCT
ACCAGCGGCA CCATCAACGT CACCGCGGGG GGGTGGGCCA TCTACGGCTC CAACCTGAAC
AACGCCTATG CATCGACGTT GGCGACCGGC TACACCGAGC ACTTCGACAC GGCCAACGGC
AGCCTGTACC AGGATGCGGT CGGCTCCAAG CAGATCACCG TGAGCGGCAC GGAGAACCCG
CGCCCGACCT GGAACTCCAC GCGCTACGGC TTCGCGGTGG TCGGGATCAG ACCCGCCGTC
ACGACCCTCG GCAACGGCAC CGCCGGCAGC TCCGCCAACG TCGCTCCGGG AGCTGTCGCG
CAGAAACTGG ACGGCTTCTC GCTGGTCACC GGCTCGACCG GAACCACCGA CTCTGTGACC
GGCCTGACCG TCACCACCAC CAACAACGCG GCCATAGCCA GCCTGTCGAT ACAGAACGAG
GCGGGAACCA CCACCTACTT CACCGCGGTC AACAACCCCG GCTCCGATAC CTGGAATTTC
AGCGGCGGCA CCCCGATCCC GGTGACCAAC ACCGCGGCGA ACTACAAGAT CGTCGCCACC
TACAAGAGCC GCGCCGCGGG CGCGCCCTCG GGGCTCACCG CCACCACGGC GAGGGTCGCC
GCCATCACCA GCGGCAACGT ATTCGCCGGA AGCGACACGG CCGACACGAC GCTGACCCTC
TCCAACGTCC ATGCGGCTTC CACCTGGGGG GCCAACACGG TCGGCAACGC CAGCGCCACC
CTGAACTGGA GCTACGGCAC CGCCGGGCAG AGCGTGATCG TCGTCCGCTA CACCGCCAAC
ACCGACACCA CGAAACCTGC GGACGGCACC AGCTACGCCG CCGGCAACAC CCTCGGCACC
GGCACGGTAC GCTACGTGGG GAACCTCTCC ACCTTCACCG ACAGCGTGGG TCTCGTGAAC
GGGACCGCCT ACTATTACAA GATCTTCGAG TACGACAGCT ACGTTAACTA CTACAACGCG
AGCGACGTCT GGACCGGGCC TCTCACCCCG GTAAGCCCCG ACGCCGTCGC GCCGACCGTT
AATCCCGGCT TCGCCGCGAA AACCCCGGGC AACACCCTGG TCGTCCCGAT CACCGCCGGC
AGCTTCGCCG GCAGCGACAA CGTCGGCGGC AGCGGGATCT CCGGCTACCT GATCACCACC
AGCGCCACCC AGCCCGCCGC CAACGACCCG GGGTGGACCC CCACCGCGCC CGCGACTTTC
ACCGTCGCAG CGGCGGGTAC CTATACCCTC TACCCATGGG TCAAGGACGG CGGCAACAAC
GTATCGGCCC TTTATGTCAA CCCCGTCACC GTCGTCGTCG ACATGACCGC GCCGACGGTC
GCCACCTTCA CCGTCAACGC CCAGACCAAC AGCCGCAACA TCCCCATCGG CGGGATCACC
GGCACGGACG TCGGCACGAC GGTCGCGGCA TACCTGATCA CCTCGACCAG CACCCAGCCC
GCTGCAGGCG CGGCCGGCTG GAGCGGCACG GTACCCGCCA CCTTCACCGT GGCCGCGGAC
GGGACCTACA CCCTCTACCC CTGGGTGAAG GACGCTGCGG GGAACGTCTC GGCTGTTTCC
GGGGTGACCC GCTCCGTGCT GGTCGACACC GTCGCCCCGA CGGGGCTCTC GGCGGTTTCA
CCCGCCGACG CCTCCACCGA CCAGGCGTTC AACTCGACGC TGCAGTCGAG CGCCGCCACG
GACGCCGGGG TCGGGAACAT CAGCTACTAC TTCGAGATCA CCGACGGCGC CGCCTACACG
GCGAACAGCG GCTGGATCGC CTCCACCTCG TGGGCCCCCG CGGGGCTCGT CGCCGGCACC
ACCTACACCT GGACCGTCAG GGCGCGCGAC GGCGTATTGA ACCAGGTCTC CGCGACGCCG
AGGACCTTCA CCACTGCCGC TGCCTGCGTC CGCAACGATC CGACCGTCAC GCTCCTGACC
CCGACCGGCG GCGTCGCTTC CACCATCTCC GTCGACGGCG GGACCTCGGT CTACAACCTC
AAAGTGGTGA ACAACGACTA CGGCGGGTGC GGCTCCACCA CCTTCAACCT CGGCGTCTCC
GATACCGACG TCGGGGACGT CTTCGACCCG CCGACCCTCG CGGTCCCCTC GGTCACCCTC
GCCACTCGCG CGCAGACGAC CACGACCGTG ACCGTCAAGG CGACCGTGGA CCACCTGAGC
GGCGCCGGCA AGACCCGCGC CTTCACACTC GCCGACGCGA GCCACGCGCA GGTCACCACC
GGCGACGTGC AGACCACCCT CAACGTGGTG AGCTGCACCA AGAAAACCCC GCTTCTCATC
GTGGGGCCGG ATTCCGGCTA CTTGAACCGC GGCGGCAGCA TGAAGTACAC GGTCACCGTG
AAGAACACCG ACTCGGGCAC CGGCTGCGCC CCGGTCACCT ACAACCTGGC GATCCCCTCC
GAGACCAACA CGACGGATTT CAACGCCTCG TCGTTCAGCG CCCCGAGCAT CGTCCTCAAC
TCCGGCCAGA TCGGTTCGGT GACCCTCACC GTCAGCGCCA AGGCGACGGC GGCGAAGAAC
GTGGTCAATA AAACGACCGT GGCGGCCTCG GCGGCCTTGC ACACCTCGCC GGCCAACGTC
GTGGTCTCCT CGACGGTGAA CAACCCGATG CTGCACAACT CCGACAGCAC CAGCTCCACC
AGGTGGGCCG CCGACGGCGG CTGGGGCATC CCCAACGCCC GCTACGGCGA GTTCGACTGC
ACCACCTGCC ACGTGCAGGG AGGGGCGTCC ACCAGGAACG TGAACCGCAT CCGCGAGACG
GTGAGCGCCC CCGATGCCGG CAAGGGGGAA CTCCCAGGCG CGGGTCAGGC GATCAACTAC
AGAAGGACCG CCGGCACCAG CGCGACGCAG CCGGTACCGG GTTGGGATTC CGGGGCGACC
CCCAGAACCT CCTCGGAAAA GATATGCGAG GTCTGCCACA CCTATGACGC CACCGGCGCC
AACGGCACCA AGGCGCATCC CTTCGCCACC ACCGCGACGC TCGGCAACCA CTTCGGCACC
GACGGCCGCG ACTGCATCGA CTGCCACAAG CACGGCAAGG GTTTCAGCGT GAACGGCATG
GGCTGCACCG GCTGCCACGG CACCGAGACC GAGACCATCA CCCCGGATAA CCGCTACGCG
GTGGCTCCGC CCAGAAACGC CTCGGGGCTC TCCGGCACCG TTTCCGGCGC GGGCAACGTG
AGCAACGACC CGAAGGTCGG CGCGCACCAG ACCCACTTGA GGGGAACCAT CGGCTTCAGC
AACTACTCTA CGGTCGATTA CCGCTGCGAG GCCTGCCATG GCCCGATCCC GACCAACTTC
AGCCACATGA ACTCCAACGC GCTTCCGGCG TTCCAGGGGC TGGCCACGAA GAACGGCGCC
ATGAGCGCCA CTTACAGCGG CACCAACTGC AACAACACCT ACTGCCACAA TCCTGCGGGG
ACCGGCGGTA CGCTTCTGGC GGTCAACGCA GGCACCGCGG TGTTCCCCTC CTGGACCAGC
GCGAACTACG TGGCCGGCGG CGCGAAATCC GTCGCCAACT GCAACGTCTG CCATAAGGTC
CCGGGCGACA CCGGCTTCCA GCCTGCCGGT ACGCATTCTG GGATGAACAC CGACAGCACC
GACTGCGCCG GGTGCCACGG CCATAACGGC GACGCGACCG GCACCGTGGC CGGGAAAAGG
CACATGGACG GCATCAAGTG GGGCGTCGGC AACTGCGACA GCTGCCACGG CTACGGCCCG
GGGACCTGGG CGAGCATGCC GGAGCGCTCC GGCGTAGCCG AGGGGAAGGG TGCGCACGAG
AAACACATCG CCTACCTGAT GCAGCGTAAC AACGCCACCC TCACCGCCAC CAGCGACGCC
TTCGGCTCTG GTACCCCGTG GACAGCGGTA TGCGGGGTAT GCCATAATGG CGCCGCCCAC
GACATGGGCG AAGCCATCCC CGGCACCGGG CGCACCATCT CCATCACGCC TGCCCGCAAC
TTCGGCGGGA CCGCCGTCTA CAACGGCATA GTGGGCCAGT CTTCGCAGTT CAAGGCCAAG
ACCTGCTCCA ACGTCGATTG CCACTATAAA GAAACGCCAG TCTGGTCGTC GTACTAA
 
Protein sequence
MGKVIVKNIR GMGWGARTCL VAVFTIFAVV LCLQMRDARD AQAAVAVQTQ WAILGTGSTS 
TLPTMTLAKG AGNNRLLVVK VVAEYSSSTS TFTPTVRYGG QTLTKIVATD TTSNQKVWFG
YLKETGIVAA TGTTPTLTVT WSSTPNSGVG VSAAFYSNVD QGSPITGSRA VGSTSSATTP
TSGTINVTAG GWAIYGSNLN NAYASTLATG YTEHFDTANG SLYQDAVGSK QITVSGTENP
RPTWNSTRYG FAVVGIRPAV TTLGNGTAGS SANVAPGAVA QKLDGFSLVT GSTGTTDSVT
GLTVTTTNNA AIASLSIQNE AGTTTYFTAV NNPGSDTWNF SGGTPIPVTN TAANYKIVAT
YKSRAAGAPS GLTATTARVA AITSGNVFAG SDTADTTLTL SNVHAASTWG ANTVGNASAT
LNWSYGTAGQ SVIVVRYTAN TDTTKPADGT SYAAGNTLGT GTVRYVGNLS TFTDSVGLVN
GTAYYYKIFE YDSYVNYYNA SDVWTGPLTP VSPDAVAPTV NPGFAAKTPG NTLVVPITAG
SFAGSDNVGG SGISGYLITT SATQPAANDP GWTPTAPATF TVAAAGTYTL YPWVKDGGNN
VSALYVNPVT VVVDMTAPTV ATFTVNAQTN SRNIPIGGIT GTDVGTTVAA YLITSTSTQP
AAGAAGWSGT VPATFTVAAD GTYTLYPWVK DAAGNVSAVS GVTRSVLVDT VAPTGLSAVS
PADASTDQAF NSTLQSSAAT DAGVGNISYY FEITDGAAYT ANSGWIASTS WAPAGLVAGT
TYTWTVRARD GVLNQVSATP RTFTTAAACV RNDPTVTLLT PTGGVASTIS VDGGTSVYNL
KVVNNDYGGC GSTTFNLGVS DTDVGDVFDP PTLAVPSVTL ATRAQTTTTV TVKATVDHLS
GAGKTRAFTL ADASHAQVTT GDVQTTLNVV SCTKKTPLLI VGPDSGYLNR GGSMKYTVTV
KNTDSGTGCA PVTYNLAIPS ETNTTDFNAS SFSAPSIVLN SGQIGSVTLT VSAKATAAKN
VVNKTTVAAS AALHTSPANV VVSSTVNNPM LHNSDSTSST RWAADGGWGI PNARYGEFDC
TTCHVQGGAS TRNVNRIRET VSAPDAGKGE LPGAGQAINY RRTAGTSATQ PVPGWDSGAT
PRTSSEKICE VCHTYDATGA NGTKAHPFAT TATLGNHFGT DGRDCIDCHK HGKGFSVNGM
GCTGCHGTET ETITPDNRYA VAPPRNASGL SGTVSGAGNV SNDPKVGAHQ THLRGTIGFS
NYSTVDYRCE ACHGPIPTNF SHMNSNALPA FQGLATKNGA MSATYSGTNC NNTYCHNPAG
TGGTLLAVNA GTAVFPSWTS ANYVAGGAKS VANCNVCHKV PGDTGFQPAG THSGMNTDST
DCAGCHGHNG DATGTVAGKR HMDGIKWGVG NCDSCHGYGP GTWASMPERS GVAEGKGAHE
KHIAYLMQRN NATLTATSDA FGSGTPWTAV CGVCHNGAAH DMGEAIPGTG RTISITPARN
FGGTAVYNGI VGQSSQFKAK TCSNVDCHYK ETPVWSSY