Gene GM21_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1198 
Symbol 
ID8136523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1399323 
End bp1402754 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content63% 
IMG OID644868812 
Productcytochrome C family protein 
Protein accessionYP_003021017 
Protein GI253699828 
COG category 
COG ID 
TIGRFAM ID[TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain
[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones169 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGCT GCCGCAGTCT GAAATGCTGC AACCTGATCT TGAGTTTGAC GATGCTGTTC 
CTTGTTATGG GCATTCTCTC CGTCTCCGGT CAACCGGCGC ATGCTTCAAC GCGGCAGTAC
GTCATGACCT GCATCTCCTG CCATAAAATG CCTCCCCTCG ACTCAGCCGA CGGGACCAGG
ATTCCGTACA CGGGCGCCTT AAAAGGGAGC CATTTGGGGC ATGCGTCGGC CTCAACCTCC
TCCTGCGCTA AATGCCATCG CGACGATGTC GCCAACTACC GTACCGCCCA TCGAAACCGT
TTGATCGAAA TTTCCCCCGC AATCAACAGC GTGGCCGGCG CCGCCTACAG TCGCGGCTTC
TTCAACCAGA CCTCGGTTCC ACCGGCCATC CTCGGGACCT GCTCCAGCGT CGACTGCCAC
TTCGAGTCGA CTACCCCTTC TTGGGGAACT ACGGTGCTCA AAGCTCCCGA AGACTGCTCG
GCCTGCCACG GCTCCGCGCC TGCCGACGGG AACCACCCGG GGTCCGGGCA AAAGCACGGC
GTCTACTACG ACACCGGCAC CGGCTCCTGC GCCGTATGCC ATCCCGATCA TCTTGGGGAC
GCGAAGCCCT TCGCCCACGC CACCAGCGCA GGGAGCCGGG CCCTCGCGGT TCAGTTCACC
ACCCTCCCGA ACAAGGGGGG GAGCTACTCG CGAGACCTGA GCTATCCCAA TTACCTCCCC
AGTAAATCCG GGTTGCGCAA CGGCAGCTGC CTTGGGCTGT ACTGCCACAG TCCGGGGAAC
AAGAACAGCA GCTTCGATCC CCCGAATCAG GCCGCAACCT GGGGGGGCAC CCTCAACTGC
GCCGGCTGCC ACAAGGCGGG CCTCGCCTCC GGCAGTGTCA TGACCAGCGG CAGTCACGGC
AAGCACGTCG ACGGGTCGGT CTCTTCCTTT GCCTGTTCCA AATGCCACTT TGCCACCGCC
ACTAGTTCGA TGACCATCGC CGACGTGACG CAGCACGTAA ACGGACGCGT CGACATCTTC
TTCGGCGTAA GCACGAGCGC TGCCAACGGC TCGTACAACG GTCTCATCTC GCCGGTATCT
AAACTCCCCG GCAGCGGATA CGGCGCCTGC TCCAACGTCT ACTGCCACTC CAACGGACAA
AGCGAAGGCG GCGTAGGCAT CGCCTACCGC ACACCAATCT GGGGCAGCGG CACCACCGGC
AAATGCGGCT CCTGCCACGC AGACGGCAGC GGCCATAACG ACGCCGTTCC CGCCATGTCC
AGCGGCAGCC ATAAAAAGCA CTTGTCCTAC ACCCTGCTTG CCACCAGCGG CCCGGTCCGT
TGCACCATCT GCCACAACGT TAAAGGCGCG AAATTCACCG CGTATGCGTC GTGCAGCCAG
ATGAGCTGTC ACTCCACCGG CGGGGCAATC AAGCACTCCG ACCAGGAGAT CGACGTGAGC
CTGGTTAGTT ACTTCGGCGG GGTCTACGAC GGCACACGGG CTCCAGGCGA CGGCTACGGC
GCCTGCGCCA ACGTCTACTG CCACAGTAAC GGGCAGGCCA CCCCGAGCTA CGCCCCTCCC
GTCACCTGGG GCGCCGTGAC CTTGCGCTGC GATGCGTGCC ACGGCTCCGC CACCAGCAAG
GGGGGAAGCG ACACCACGAC GTCTCTTTCC GGCAAGCACG CCGCGCACGT GAACAACGCT
TTGGTGCTCG GCGCCGGCAA AAGCCTGCAC TGCATCGACT GCCACAGCAT CACCGTCAGC
AGCGACACGA CCATCGCCTC GACCGCGGTC CACGTCAACA AGATGCTCAA CTACACGGGG
CATTATGCCG GCGGTCCCAG GCGCTACAGC AGCACCACGA AAGTCTGCTC CAACATCTAC
TGCCACAGCT CCGGCCAGGC AAAGCCGGTG TTCCGTAACA TGACCGGCAC CAAGTCCTGG
GCCTCGACCG GCACTCTTTC CTGCAACGGT TGCCACGGCT ACGGACCGGG CACCTTCGCC
TCGGTGGCGG GCGAGCCGAA CTACCTGAAC GGCGGCGCCG GTTCGGGCAC CGCAAACTCG
CACCAGAAAC ATATGGCGGG GGCGAATCTG CTGGATTCGC GCGGTTGTGC CAAGTGCCAC
CGCAGTACCG CGGACCAGGG GATGGCCGGG AAGCTGCGCG ACTACAGTTC GGCGCACCTC
AACGGCTCCC GCGACGTGAG CTTCGCCGTG CTCGGCAATA TCTCAGGTCA CTACAGTGCA
GCCGCCAAGA CCTGCTCCAA TACCTACTGC CACGGGGGGG GCTCCATCCA GTGGGGGGGG
CAAGGGCCGC TCGCCTGCAA CAGTTGCCAC GGTGACGCCG AGACGCTCGG CACCAACGCC
CACGCCCGCC ACATAAGCCC GAGCTCCGGC AAGGCGATCT CCTGCGCCAT CTGTCATGCC
GCGACCGCAG CCGGCAACGG CTCGATCGCA GACGGCACCA TTCATGCCGA TGGCAAAAAG
GACGTCGTCT TCTCCGGCGC GGCCCTAGGG ACACAGATGG ACCTCACGGG CAACTGCTCG
ACGAGTTACT GCCACAGTAA CGGCAAGGGA AGTTACTCCA CCCCTAACTG GTCCGCGAAC
TCTTCCGGCG CTTGCGGTAC CTGCCATGCG ACGGCGCCCG GGCTCGGCAG CCCCCTCATC
GCAAGCGGCG CCCATTTCAG CCACTTCAGC ACAGCCGCTA CCAGCTATGG TCCAATGTTC
AGCACGGGTA ACGTGACCGG CTGCCAGGCT TGTCACGACT TCGGCAACGA GTTGGCTTCC
ACTCACATCG ACCAGACGGT GAATGTGAAC AGCTCGCTCG GGTATTCCAC TAGCGGCACC
TGCACTCCCT GCCACACCAA GGAAGTTAGC TGGACCGGGG GAGCCGTCTC CTGCGAGAGC
TGCCACGCCG GCACACTCTC CGTGATAAAT GGCGTCACCG CCAGCGACAA GAGTCAGGCT
GCCACGCGCG GCCACGGCGG CCCGACGATC GGGAAGGGAT GTACCGACTG CCACGAACGC
AACGCGCGGC ACATAAACGG CGGCTCCCGC CTCCGGGCGC AATTTTCCGG CGGGCTGAAC
CTTGAGTGCA ACTACTGCCA TGACGACTCC TCAGTTTTGC TGGACCCGGA CTCCCGGAAC
ATGAGTACCC ATGTTCTGGT CAAGGGGGGG ACCCCGGCGA TGGAGTGCGC CCAATGCCAT
GACCCGCACG GCTCGAATAA TCTCAAGATG ATCAGAGCCG TCATCAACGG CAAGGAGATC
GTCTTCAACG ACATGATGAA CGGCCTGATC GACACCGTGA CCAACCAGGG CATTTGCCAG
GTCTGCCACA CTCAGACCTC CCACTACCGC GCCGGAATTC CCGAAACCGA TCACCCGACC
TCGGGTTGCC TCTCCTGCCA CCCGCATGTC GGAGCCGAGG CCGCGTTCCT TCCACAGTCT
CGGCGATACT AA
 
Protein sequence
MNSCRSLKCC NLILSLTMLF LVMGILSVSG QPAHASTRQY VMTCISCHKM PPLDSADGTR 
IPYTGALKGS HLGHASASTS SCAKCHRDDV ANYRTAHRNR LIEISPAINS VAGAAYSRGF
FNQTSVPPAI LGTCSSVDCH FESTTPSWGT TVLKAPEDCS ACHGSAPADG NHPGSGQKHG
VYYDTGTGSC AVCHPDHLGD AKPFAHATSA GSRALAVQFT TLPNKGGSYS RDLSYPNYLP
SKSGLRNGSC LGLYCHSPGN KNSSFDPPNQ AATWGGTLNC AGCHKAGLAS GSVMTSGSHG
KHVDGSVSSF ACSKCHFATA TSSMTIADVT QHVNGRVDIF FGVSTSAANG SYNGLISPVS
KLPGSGYGAC SNVYCHSNGQ SEGGVGIAYR TPIWGSGTTG KCGSCHADGS GHNDAVPAMS
SGSHKKHLSY TLLATSGPVR CTICHNVKGA KFTAYASCSQ MSCHSTGGAI KHSDQEIDVS
LVSYFGGVYD GTRAPGDGYG ACANVYCHSN GQATPSYAPP VTWGAVTLRC DACHGSATSK
GGSDTTTSLS GKHAAHVNNA LVLGAGKSLH CIDCHSITVS SDTTIASTAV HVNKMLNYTG
HYAGGPRRYS STTKVCSNIY CHSSGQAKPV FRNMTGTKSW ASTGTLSCNG CHGYGPGTFA
SVAGEPNYLN GGAGSGTANS HQKHMAGANL LDSRGCAKCH RSTADQGMAG KLRDYSSAHL
NGSRDVSFAV LGNISGHYSA AAKTCSNTYC HGGGSIQWGG QGPLACNSCH GDAETLGTNA
HARHISPSSG KAISCAICHA ATAAGNGSIA DGTIHADGKK DVVFSGAALG TQMDLTGNCS
TSYCHSNGKG SYSTPNWSAN SSGACGTCHA TAPGLGSPLI ASGAHFSHFS TAATSYGPMF
STGNVTGCQA CHDFGNELAS THIDQTVNVN SSLGYSTSGT CTPCHTKEVS WTGGAVSCES
CHAGTLSVIN GVTASDKSQA ATRGHGGPTI GKGCTDCHER NARHINGGSR LRAQFSGGLN
LECNYCHDDS SVLLDPDSRN MSTHVLVKGG TPAMECAQCH DPHGSNNLKM IRAVINGKEI
VFNDMMNGLI DTVTNQGICQ VCHTQTSHYR AGIPETDHPT SGCLSCHPHV GAEAAFLPQS
RRY