Gene GM21_1193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1193 
Symbol 
ID8136518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1385563 
End bp1388619 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content65% 
IMG OID644868807 
Productcytochrome C family protein 
Protein accessionYP_003021012 
Protein GI253699823 
COG category 
COG ID 
TIGRFAM ID[TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain
[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones106 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAG TCATAGCGGC GTTTTGGTTA ATACTGCTGG CGATCTCGAC AGCCTCTGCA 
ATCGAGTCGC CGCACATATC CAATATGTCC TGCTCGGGGA CCATGGGGTG TCACACCATA
GCGCTCATCG ACGGCGTCTA CGTAGCCACG CTGGTAGACG CCTACGGTGT CAACAACCTC
TGTATCAAGT GCCACAACCC GCTGACCATG GTGCGCGGAT TCTCGGCCAG GGACATAGCG
AACCCGTTCG GGTCCACCGA CACGGGGCTC TTTCCGTCCG AGCAGTTCCA GAGCTCCCAT
AACTGGGCCG GACCGGTCCA TATGCCTGCC GCAGGCGCCC AAATCTCGAC AGATCCGGCG
ATAATCGCAC TTAGGCCCAC GAAGGCGGGA GTCCTCGGTT GTACCAGCTG CCATAATCAC
CACTCCTTCA CCGGAGACCT GTTGCTGAGG CGCCCCGGCG ATACCCTCTG CCTCGATTGC
CACCGGCAGC GCAACCAGCG CGACGTCCAA AGCGGCACCC ATCCGGTCAA CTTCAACTAC
ACCAGCGCAA CCTCGAAGGT GAAGCTCACC CCTGCGAAGT TCTTCGAGGA GCCCGTCAAC
GCCAATCCGG CGAACCCCAC TTCCGCCATG CTGCTGCCGG GAGGGAAACT TGTCTGCACC
ACCTGCCACT CCCCTCACTA CGCCGACTCC AACGCGCGGA CCTTCGACAA CGCCTCCTCC
TCCACCTTCG GGCTCCTTTC TTCCTCGCGA GGGAGGCTTT TGCGCACCGA CGTGAGGGGG
GGGAGCGCCG ATGAGGTGAA CATCTGTACC AACTGCCACG CCGGGAAACG CGCCCATAAC
GGCAAGGGAC AGAACATCCA GTGCGCCGAC TGCCACGCAG CCCACGTCGA CCCCCTGGAC
GGCACCGCCC CCAACGTCTG GCTGGTGCGC CGCTACATGA ACCCGCAGAC CACCAACGTC
AGGGTCGTCA ACCAGTACAA GGACGCAGGG AGCAACTGGG CCGGCCCCGA CGGCGTCTGC
GTCGCCTGCC ACGCCATACC GGCTCCGGGA GGGAACTACC CCCCCGAACA CGCAAGCACC
GACCCCAACG TCTGCCGCAG CTGCCACATT CACGACAGCG CCGACGGCTC CTTCGCCGCC
GGATGCAACT CCTGCCACGG CAACCCGCCG CAGACAAACG CGGCCGGGCC GAGCGGCTAC
GCCAGCAGCG GGAGTTACAA CTACGCCACC AGCGGCGTCT TCAAGGACGA GTCGCTCACC
CCGCACCTGG CACACACCAG CCGGGGATTG ACCTGCGCCG CCTGCCACTC CGGCAACCAG
CACGCAAGCG GCGACTTCCA GCAGGTATTC CGGACTCCTT CCGGCACGGC GACCTACTAC
GGCGCCGCCC CAAGCTACGA CCCGGCAGGC CCGGGGAGCT GCCTCACCAA CTACTGCCAC
AGTAACGGTG CGCCTTCCAG TCTCCCCCCG GTGTACAAGA CCGCGACCTG GGCCCTGGGC
AACAACAGCA TCGTCGGGAC CCCCGAAGAG TGCTCCGACT GCCACGACGC GGCCCCCGAT
ACCAACGCCC ACACGGGCCA CCTTTCGCGC GGCTACAGCT GCACGGTCTG CCACGCCGCG
ACCGCAGCCT CCAACAGCAG CATCAAGGAC GCCTCGAAGC ACGTGAACGG GATCAAGGAA
ATGGCCTTCA TCGGCGCCGC CCTCGGTACC GAGATCGACG TCTCCGACAC CTGCACCACG
AGCTACTGCC ACTCGAACGG CAGAGGGGTC TACTCCTCCC CGAACTGGAC CATGAAGGCC
ACCGGCGCGT GCGGCACCTG CCATGCCACG GCCCCCGGTC TGGGGAGCCA GCTCATCGCA
AGCGGCGCCC ATTTCGCCCA CTTCAGCACC TCGTCCGCGG CCTACGGGCC GATGCTGACC
ACGCAAAACT CGACCGGCTG CCAGGCCTGC CACAACGTGA GCAGCGCCAA CCACGTCAAT
CAGAGCATCG ATCTCAACGG CTCGCTCGGC TACCTGGGCA ACGGTACCGG TACCTGCACC
CCCTGCCATC CGACGCAGGT GAACTGGAGC ACCGGAGCCG TCACCTGCGA GAGCTGCCAC
ACCGGAACCG TCTCGGTGAT AAACGAGGTC CCCGCGCCGA ACAAGAGCCT GGCGGCGACT
GCCGGCCACG GCGCCCCGGC GCTAGGGAAG GGGTGCACCG CCTGCCACGA GCGCAACGCG
CGGCACATAA ACGGCGGCAG CCGCCTCCAG GCGCAATTGA GCGGCAGCCT CAACGCCGAC
TGCAGATACT GCCACGACAA CGCCTCGGAG GTGGTCACCG AAGGCTTCCG GAACATGAGC
ACCCACTTCC TGACCAAGGG GGGGAGCCAG GCCATGGCCT GCGCTAAGTG CCACGACCCG
CACGGCTCGA CCAACCTGCA CATGATCAAA ACGCTCATCA ACGGCCAGGC CATCGTGTTC
AACGACGCGG TTAACGGCCT GGTGAACACC ACGACCAACC AGGGGCTTTG CCAGGTCTGC
CACACCCAGA CCGCCCACTA CCGCGCCGGC GTGCCGGAGA CCTCGCATCC GACGACAAAC
TGCCTCTCCT GCCACGACCA CCGGGCAGCC GGAGGCGCCT TCAAGCCAGC CGGAACCTGC
GACGCCTGCC ACGGCTACCC CCCCGCACCC AAGGCGACCA TAACCCCGCA GCTCTTCGGC
GTGATGGGTA GCTGGTCTTC GGCGCGCTAC GAGGATTACT CCGGAGGCGG CGGCGCCCAC
CTGGTCGCCG CTCATGTTTC CCCAAATGCC AAGCCGAGCG AGGGGTGGAG CAACTGCGCC
ATCTGCCACA GCGGCGGCTC CACGGGCGAC TCCGGCAACC ATAAGATGAC GATGCCGCTG
AAAGGGCACA TCGAGAACGT CGACCTGGTC GTGGATAAGA GGTTCCGCTT CGCCAACAGT
TTCATCGTCT ACACCGGGTC GCAAAGGGTC AGCGCTCCGG CGCAGAACGC GACCGGGAGC
TGCTACAACG TGAGCTGCCA CATTACCAAG TCACGGCGCT GGAGTATCGA GAGGTAA
 
Protein sequence
MKRVIAAFWL ILLAISTASA IESPHISNMS CSGTMGCHTI ALIDGVYVAT LVDAYGVNNL 
CIKCHNPLTM VRGFSARDIA NPFGSTDTGL FPSEQFQSSH NWAGPVHMPA AGAQISTDPA
IIALRPTKAG VLGCTSCHNH HSFTGDLLLR RPGDTLCLDC HRQRNQRDVQ SGTHPVNFNY
TSATSKVKLT PAKFFEEPVN ANPANPTSAM LLPGGKLVCT TCHSPHYADS NARTFDNASS
STFGLLSSSR GRLLRTDVRG GSADEVNICT NCHAGKRAHN GKGQNIQCAD CHAAHVDPLD
GTAPNVWLVR RYMNPQTTNV RVVNQYKDAG SNWAGPDGVC VACHAIPAPG GNYPPEHAST
DPNVCRSCHI HDSADGSFAA GCNSCHGNPP QTNAAGPSGY ASSGSYNYAT SGVFKDESLT
PHLAHTSRGL TCAACHSGNQ HASGDFQQVF RTPSGTATYY GAAPSYDPAG PGSCLTNYCH
SNGAPSSLPP VYKTATWALG NNSIVGTPEE CSDCHDAAPD TNAHTGHLSR GYSCTVCHAA
TAASNSSIKD ASKHVNGIKE MAFIGAALGT EIDVSDTCTT SYCHSNGRGV YSSPNWTMKA
TGACGTCHAT APGLGSQLIA SGAHFAHFST SSAAYGPMLT TQNSTGCQAC HNVSSANHVN
QSIDLNGSLG YLGNGTGTCT PCHPTQVNWS TGAVTCESCH TGTVSVINEV PAPNKSLAAT
AGHGAPALGK GCTACHERNA RHINGGSRLQ AQLSGSLNAD CRYCHDNASE VVTEGFRNMS
THFLTKGGSQ AMACAKCHDP HGSTNLHMIK TLINGQAIVF NDAVNGLVNT TTNQGLCQVC
HTQTAHYRAG VPETSHPTTN CLSCHDHRAA GGAFKPAGTC DACHGYPPAP KATITPQLFG
VMGSWSSARY EDYSGGGGAH LVAAHVSPNA KPSEGWSNCA ICHSGGSTGD SGNHKMTMPL
KGHIENVDLV VDKRFRFANS FIVYTGSQRV SAPAQNATGS CYNVSCHITK SRRWSIER