Gene GM21_3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3540 
Symbol 
ID8138912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4086349 
End bp4090218 
Gene Length3870 bp 
Protein Length1289 aa 
Translation table11 
GC content65% 
IMG OID644871159 
Producthypothetical protein 
Protein accessionYP_003023319 
Protein GI253702130 
COG category 
COG ID 
TIGRFAM ID[TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones138 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGA TAATCAGGAA AAAAATCCAT GGGATGAGCT GGTGGTCGAA AGCCAGCCTT 
GTTCTCCTCT TCACCTTGGT GACATCCGTC TTCATGTACC AGGGGTGGTA TAAGCCGATC
CAGGCTGCGG CCGGCGTTTC CTATCTCGGC ATGTCGTCCG CCACCGGAAC CTCCACGACC
CTTTCGGTCG CGGCGCCGGC CGGCCTGCAG GCAGGCGACC TGATGGTCGT CAGCATCACC
AACCGGACCA CGTCCACGAC GCTTCCCACC ATGGGGTCGA CAGGGTGGAC CCAGCTCCAT
CCGCTTGTCT TGCGCACAAG TTCGACCTAC AGGCGCAGCA ACAGCGCCTA CAAGGTGGCC
ACGGCCGCGG ACGTCGGCGC CACTTACAGC GTTTACGGCG GCTCCGGAAC CATCGCGGCG
AACATCTACG CCTTCCGCGG CGTAGACCCC TCGAACCCGA TAGACGCAAG CAGCGGGAAA
GCCAATACCA CGGCATCCTC CACTTTGTCG GCCAATACCA TCACCACCAC GGCGGCCAAC
GATGTCGTCG TGTTCCTGGG GCAGGTGGCC GAAACCACCT CGGTCAACTT CAGCAACTGG
GCGACGGCCA ACATAGCTGC CGCCAACTGG ACCGCCATCG ACAACACCGG CTCAACCAGC
GGCTCGAACG GCGCGGCATA TGCCACCAAG GCCGCAGCGG GGGTCGTAGG AGCCGGAACG
GCCACCCTCA GCACCAGCTA CATAAACGGC GGCGTGCTGA TCGCACTGAG ACCCATAGCC
ATCGTGAATC CTGCTGTCAC CGGACTGTCG ACGAGCAGCA TGACCCAGGG GAGCGGACCC
ACCACCGTGA CCATCACCGG TACCGGCTTC CAGGACGGCG CGACCGTCGC AGTCAGCGGC
ACCGGTGTCA CTGCGGGGAC CGTGTCGTAC GTTAGTGCAA CCTCCCTTAC CGTCCCTCTC
ACCGTCGCGG CCGACGCCGC CGCAGGGGCG AGGACGCTCA CGGTCACCAA CCCGGACACC
GGTTTCGGAA CCACCACCTT CACCGTTAAC AGCGCGCCGG CCCCGACCGT CCTTTCCACC
AGCCCGGCGT TCATGATTCA GGGGGCAGGG CCGACCACCG TCACCATCGC AGGCTCCAAC
TTCCAAAGCG GCTCGACGGT CGCCTTCAGC GGCACAGGCA TCACGCTCGG CGCGGTCACC
TACGTAGATC CCGCCACCCT CACCGTCCCG GTGACGGTCG CCGTCGGGGC GACTCTCGGC
GGGCGCGACG TGACGGTGAC CCTGCCCGGC GGGGTTAGCG GCACCGGAAC CGCCCTGTTC
ACCGTCAACT CCCCCTGCGC GGCGGGGCAG CCCACGGCAC TCAATCACGG CGCCGAGACC
ACCAACTCCG TTCCCCTTAC CTGGACCCCG GGCGCCAACA CGAACTACTC CATCGTCTTC
CGCGACGGAG TCCAGATAGC CACGAACGTC CCGGCCGGCT CCTACACGGA CAATACCGTC
AGCCCGGGCA CCTCCTACAG CTACACGGTA ACCGGCTACA ACACGGCCGG AAGCTGCCAG
AGCGCGCCTT CCGCCGCGAC CACGGCCGTC ACCCTGGCAC AGGCGCCGGT AGCGCCGGCG
GTATCCAACG TAGGGAGCGG CACCCAGCTT CTCGTCGCGG TCAACCCCGA CGCGAACTCC
GCCACGACCC AGTACGCCAT CAGGATCAAC GGCGGGGCCT ACGCCAACCA GTACGTCCAA
TCCGGCGGCA CCGTCGGGGC AACCGCCGCC TGGCTCACCC AGGCGGGCTG GGGCGCCAAG
ACCATTTCCG GCCTGACCAG CGGCACCCCC TATACCTTCG ACGTCAAGGC GCGCAACGCC
GCCCTCGTGG AAACCGGGTT CGGCCCCAGC TCGGCCGCCA CACCGAAGAT CGCCCTTTCC
TCGAACATCA CCAGCTGCGC AGGGTGCCAC GGCAATCCTC CGGCCGACGG CACCGGCCGG
AACGTTCCCG CCGGCCAGTT CAAGGGCTCG CACGCGAAAC ACACCGGCAT CGTGGCCTGC
TCCGCCTGCC ACAAGGACAA CGGCGTCTCG CCTGCCGGCA ACAAGCATTC CGACGGCATC
ATCGACATCG CCAGTCCTCT GCGCCAGATA GCGGGCGAAA GCTATGGCCA AACGAGCCAT
GCGGTCTCGG AAACACCGAC CTTCGCCTCC TGCACCACCT ACTGCCACAG CCAGGGGACC
AGCAAGACCT CGCAGCCGAG CGAGAGCCGC ACCACGTTGT CGGCGCCCGC CGTGACCCTC
GTCTGGGGAA CCGGAACCAG CACCTGCGCC TCGTGCCATG GCGCTCCCCC CGCCTATGCA
AACGGGGCGA CAACCTGGGG GGCGGCTAAG GCCAACGCCC ATGGTGGAGC AACGCATGCC
TCCAAGACCT GCGATTACTG CCATACCAGC GTGACCTACA GCGGCGGGGT GTACACCGCC
AATACGCAGC ACGCAAACGG CGCCTACGAC ATCCAGCCTT CCTTCGGCTA CACCTACGCC
GCCACCGGCG GCACCTGCGC CACCTCCGGT TGTCACGTCA GCATTGCCTG GAACGGGAAG
CTTGGTTGCA TCGATTGCCA CTCCGCCCCC ATCACGCGCA AATACGGGCG CCCGAACGCC
CAATTGGCGG GGGTCACAAA CGAGTTCGGC AAGGTCTGGG GGCACAAAAA GCAGGGACGC
GGCGCCGTGA CCGACGCTGA CTGCATCGTC TGCCACCTCG AGGGTAGCTT CAGCACCCAG
AAGGCGAGCA CAACCTACCA CCAAAACGGC AACATCGACC TTCGCGATCC GGACGTTGAG
GGGGAGACCC CGATCACCAA CGTTTCCGGG TTGAATGGCG GGGCCTTCAC CTTCCAGCGC
TTCACCACCA GCTACGTTGC GGGAACCAGG TTCACCGACG GGCATACCCG CGACACCGTA
GACAACGTCA TCACCCAGAA ATTCTGCCTC AAGTGCCATG ACAGCAACGG CGCCCTCAAT
ACGACGGCCC GCTCGGGAGT ATCGCCGACT CAGTACCTCC CCTTCGGCAC CGGTGCGCTC
AACAACGTCA CCTACTACAC GCTCGGTCTG AGCGCAGGCG TTGCCGGCGG GATTGTCAAT
GTTGACAGCC AATTCGCCGT CACCAACTCC TCGCGCCACC CGGTCAAGGG ACCGTTTTCC
GCAGGGTTTC CGGGCACCGG CAGGCTCGCC GCGCCGTACA ACAACTTCAC CCGTACCGCC
GGCACGCTCG CCAACAGCGT GGTGCTGAAC TGCTTCGACT GCCACAACCA GCCGTCGCCG
AATCCGCTTC TGACCACCCG CACGGTTACC GCACACGGCA ACGCGGATAC CATTCGCGGC
ACGGCGACCA TCGCTCCTGC GACGACACCG TCGACCACCA ACCGGGTGAC CCTCTGCGTC
ATCTGCCATA CGGGTTACGA CAGCAACACC GGGCAGGACC ACGGCACGTC CGGCACCGGC
ACGACAGCTA CCGCGGTCTC GGGCGGCGTG CTGGACCGGT CGGAGAAGGT GAGCTTCCTG
CGCTATGGCT GCAACGTCTG CCACAGCAGC GGGCTTAACC CGGCACGCCA GCGCCCGATC
CGAGGCGAAG ACGCCCACGG CTTCAACGCG CTGCGTGACC AGAGCGGCTC GACCAACCTC
ACGCCTACGT CGAGGTGGGC CACCGGCGCC AACAAGCGGC CGTACGCCTT CATACGCAAC
GCCACTTACT TACGTAATCA CAGCCCGAAA TCGATAGCCG GGGTATCCTA CTCCGCCGAG
TGCGACATGG GCGGAGCCAA CGGCGACGGC AACAGTATCT GCAAGAGCAT CGGGGCAGAA
ACCTACTCCA CCAGTGGTGG CGTCTACTAG
 
Protein sequence
MGKIIRKKIH GMSWWSKASL VLLFTLVTSV FMYQGWYKPI QAAAGVSYLG MSSATGTSTT 
LSVAAPAGLQ AGDLMVVSIT NRTTSTTLPT MGSTGWTQLH PLVLRTSSTY RRSNSAYKVA
TAADVGATYS VYGGSGTIAA NIYAFRGVDP SNPIDASSGK ANTTASSTLS ANTITTTAAN
DVVVFLGQVA ETTSVNFSNW ATANIAAANW TAIDNTGSTS GSNGAAYATK AAAGVVGAGT
ATLSTSYING GVLIALRPIA IVNPAVTGLS TSSMTQGSGP TTVTITGTGF QDGATVAVSG
TGVTAGTVSY VSATSLTVPL TVAADAAAGA RTLTVTNPDT GFGTTTFTVN SAPAPTVLST
SPAFMIQGAG PTTVTIAGSN FQSGSTVAFS GTGITLGAVT YVDPATLTVP VTVAVGATLG
GRDVTVTLPG GVSGTGTALF TVNSPCAAGQ PTALNHGAET TNSVPLTWTP GANTNYSIVF
RDGVQIATNV PAGSYTDNTV SPGTSYSYTV TGYNTAGSCQ SAPSAATTAV TLAQAPVAPA
VSNVGSGTQL LVAVNPDANS ATTQYAIRIN GGAYANQYVQ SGGTVGATAA WLTQAGWGAK
TISGLTSGTP YTFDVKARNA ALVETGFGPS SAATPKIALS SNITSCAGCH GNPPADGTGR
NVPAGQFKGS HAKHTGIVAC SACHKDNGVS PAGNKHSDGI IDIASPLRQI AGESYGQTSH
AVSETPTFAS CTTYCHSQGT SKTSQPSESR TTLSAPAVTL VWGTGTSTCA SCHGAPPAYA
NGATTWGAAK ANAHGGATHA SKTCDYCHTS VTYSGGVYTA NTQHANGAYD IQPSFGYTYA
ATGGTCATSG CHVSIAWNGK LGCIDCHSAP ITRKYGRPNA QLAGVTNEFG KVWGHKKQGR
GAVTDADCIV CHLEGSFSTQ KASTTYHQNG NIDLRDPDVE GETPITNVSG LNGGAFTFQR
FTTSYVAGTR FTDGHTRDTV DNVITQKFCL KCHDSNGALN TTARSGVSPT QYLPFGTGAL
NNVTYYTLGL SAGVAGGIVN VDSQFAVTNS SRHPVKGPFS AGFPGTGRLA APYNNFTRTA
GTLANSVVLN CFDCHNQPSP NPLLTTRTVT AHGNADTIRG TATIAPATTP STTNRVTLCV
ICHTGYDSNT GQDHGTSGTG TTATAVSGGV LDRSEKVSFL RYGCNVCHSS GLNPARQRPI
RGEDAHGFNA LRDQSGSTNL TPTSRWATGA NKRPYAFIRN ATYLRNHSPK SIAGVSYSAE
CDMGGANGDG NSICKSIGAE TYSTSGGVY