Gene GM21_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3544 
Symbol 
ID8138916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4102550 
End bp4105870 
Gene Length3321 bp 
Protein Length1106 aa 
Translation table11 
GC content64% 
IMG OID644871163 
Productcytochrome C family protein 
Protein accessionYP_003023323 
Protein GI253702134 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones143 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGAA AATTGCTCAT GATACTTGTG TTGCTGCTGG CTATGCCGGC CTTTGCCAAC 
GCATGGTATG TCAATGCGAA GACTTCGCCG ACCAGCGGTG CGGGCACCAT CACCCCGTCG
GGCAACAGGA CCTATGCCGC GGGGGTGAAC AGCGAGGAGT TCACCGTCAC TCCGGCTCCG
GGCTACACCC TTTCCCGCGT CACGCTCGAC GGCGTCGCCA TCGCCCCCAA CGCAAACGGG
AAGTACGTCG CACCTTACGT CTCGACCCTG ACCTGGCGCT ACATGGTGGC AGTCTTCTCG
GCCGGCACGG TGAACATCAC CACGAGCGTT ACCGGTAACG GCGCCATCAC CGAGGCCAAC
TATCTCTCGC TCACCTCGAT TCCGGTCGGT TCCGCCCGGA CCCTTCTGGT GGCTCCCAAC
AGCGGCTACG AAATCAGCAC CTTGACCGCC AGCGGCTCCC CTTCCATCAC TATCCAGGGG
GACGGCACCC GGCTGGTCAC CTACAGCAAC CTGCAGGCGA ACCAGAGCGT CACTGCCGGC
TTCTCGCTGA TCCCGATCGT GGTCGCCAAC GCCGGTAGCG ACGTCACCAC TACGGGCCCA
GGCGCGGCTT ACGCGGTGAC CCTCTTCGGC AGCAACAGCA CAAGCAACCA GGGGGCGATC
AGCTACCAGT GGAGCGGGCC GCCGGCTTTG ATGTTCGGTT CACCCACTTC CGCCGATACC
ACCGTATATT CGGACATCCC CGGGGATTAC ACGGCGACAC TCACCATCAC CTCGAACGGC
ATCACCCGCT CCGATACGGC CATCGTGCAC GTCGTCACCC GTAACTCCTA CCTGGTGTCT
GAGTGCACCA AGTGCCATTC CGGAAACACC ACGGCGCTCG TGGGCCTTTA CAACGGTTCG
CCGCATCTTG AGACCAACGC CTGCCAGGGT TGCCACACCG ACAGCCCGCA CGTGGCGCTG
CCGTCGCCCA ACGTCTGCGC CGACTGCCAT ACGGACACCT CGCGCCATCC CTTCGAGATC
ACCGGCACCT GCACCTCCTG CCACAACTCG CACTCGACGT TGGTCGGGAC AGGCTCGGTG
GACAGTCTGC ATTACAACAA CATCACCACC GGCATGTACC CCGCCTCCTT CGTGACCTCG
CGCGCAGCCT GCGCCAACTG CCATAACAAC ACGATTTCCA ACAAGGCGAT CCGGATACAG
TGGCGCGCGG CGCGGCACGC TAACATCACC TCCATGGGGT GGATCGCCAG GGACTTCAAG
ACCCTTAACG GCTGCGTGCG CTGCCACACC ACCACCGGCT TCATCGCCTA TTCCACGGGC
AAGGTAACGG CGGCTTGGGG CGTGGCCGAA GACAAGACCA AGGAAGTACT TACCTGCATC
GGCTGTCACA GCGACAGCGT GTCCGGGGCG GTGCGCAGCA TGGCGCAGGT TGCGCCGTAC
CCCGACAACA GCTTCGTCAC CCCCGACACC GGCAAGTCCA ACGTCTGCCT CCCCTGCCAC
ACGGGCACCA ACAGCGGCGA GAGCATCAAG GCGCTTTTGC AGGCCCAGGC CGACTTCGGC
AACATAGGTT TCGTGAACCC GCACTACAAG GCGGCCACCG GCTCCCTCTA CGGCGTGGTC
GGGTACCACT TCTCGGGGCG CAGCTACACC ACCGAAGCGA CCCACAACCA TCTGGGCATC
AGCGACGGCG GCGGGGCTTG CGTCTCCTGC CACAGAAACA GCATGAACGG CCACACCTTC
CAGGGTGAGG TGACCCCTGC CTGCGCCACC TGCCACGGCA CCAGTCTGGA CGAGGCCTCC
CTGCACGTGG ACCACAACTA TTTCCTGAAC TCCCTCGAGG TTCTGAGGGC GCAGTTGGCC
GCCAAAGGCT ACGCATATTC GCTTACCAGC CGCAGCTTCA GCGCCACCGA CTGGGGCGTC
GGCCAGGCGG GCGCCGACAC CATGGGCGCC GCGTTCAACT ACGCGCTGCT CGTCTCCGAA
CCGGCCGCCT TCGTGCACAA CCCGAAATAC GCCCAGGAGC TGGTGATCGA TTCCATCGAC
TACCTGGACA ACCGCCAGTT CGACGATTCC GTGGCCGGCA CCGTGCAGGC CCTGCTTGAC
TCGGGAGCGA TCAGCCAGGA GGTCGCCGAC AGTTTCGGCA CCTATAAGCA GAAGAACATC
TGCCTCTCCT GCCACGGCGG CGACTCGATC ACCTCGCGCC CCATGGCCAG CAACGGGCAC
CCGACGCACC TGAGCGCCGC CTACGGTCCC GATGACTACC TGCGCACGCA GAGAAGTGTC
TGCGACGCCT GCCACGGCAA CGACTTCGCC CTGCACTCCA ACGGCACCGT CAACGTGCTG
AGCGACGCCT GCGTGAACTG CCACGCCGGT TCGGTTCCCG CCTGGAACTC CACGGCCCGG
ATCGCCTGCG AGGTCTGCCA CTCGGCGAAC CCGGCGAGGC TTCCCAACGG AGTGGCGGCG
CCCTCCAAGG AGAGTTTCGC CACCTCCGGC CACGGGCAGT TCGGCGCCAG CAACCAGTGC
ACCATCTGCC ACAACCCCAA CAGCAGTCAC ATCGCCGGCA GCCTCACCAG CAACAAAAGG
CTGAAGCTGC AAAACGACAA CAACCTCTGC GCCTCCTGCC ACGACAGCGT GGTGGCACGG
CAGATGTCGA CTCACCATGG CCTTGCCTGC GTGCAGTGCC ACGATCCGCA CGGAAACGGC
AACATCAAGA TGGTACGGGG CACGATCGGG ACGCAGAGCA TCACCTACCT GAACTCCTTG
AACAACTTCG TCGACCAGAC TACCAACAAG GGGCTTTGCC AGGTCTGCCA CAGCTCCACC
CGCTACTACC GCGCGGGGAT CAGCGAGACC AGGCACTACA CTACGGGATG CCTCAGCTGC
CACTTCCACA TCAACCCCGA CGGCGCCTTC TTGCCCAGCG GCGGCGCCTG CGACTCCTGC
CACGGCTATC CGCCCGCTCC CAAGAACACG GCCACCAGCT TCGGCAGCTA CGCCAACTGG
TCCGGCGCGC GCTACGAGGA CTACTCCGGA GGCGGCGGCG CCCATCTGGT GGCGGCACAC
GTCTCCCCCT TCGCCAGTCC GGCCGAGGGA TGGACCAACT GCACCGTCTG CCACAACGGC
GGCTACCATG ACATGACCAC ACCGGTGGCG GAACACATCG GCAACGTCAC CGTGATGGTG
GACAACAACC TGCGCTTTGC CGACAGCTTC ACGGTTTACA CCGGCGCGAA GCTGACCAAC
GCCGGGCCGA ACGCCACGGG AAGCTGCTTC AACATCGCCT GCCACATGAG CCCGTCGGAA
AGGTGGAGTA CGGAAAGATA G
 
Protein sequence
MFRKLLMILV LLLAMPAFAN AWYVNAKTSP TSGAGTITPS GNRTYAAGVN SEEFTVTPAP 
GYTLSRVTLD GVAIAPNANG KYVAPYVSTL TWRYMVAVFS AGTVNITTSV TGNGAITEAN
YLSLTSIPVG SARTLLVAPN SGYEISTLTA SGSPSITIQG DGTRLVTYSN LQANQSVTAG
FSLIPIVVAN AGSDVTTTGP GAAYAVTLFG SNSTSNQGAI SYQWSGPPAL MFGSPTSADT
TVYSDIPGDY TATLTITSNG ITRSDTAIVH VVTRNSYLVS ECTKCHSGNT TALVGLYNGS
PHLETNACQG CHTDSPHVAL PSPNVCADCH TDTSRHPFEI TGTCTSCHNS HSTLVGTGSV
DSLHYNNITT GMYPASFVTS RAACANCHNN TISNKAIRIQ WRAARHANIT SMGWIARDFK
TLNGCVRCHT TTGFIAYSTG KVTAAWGVAE DKTKEVLTCI GCHSDSVSGA VRSMAQVAPY
PDNSFVTPDT GKSNVCLPCH TGTNSGESIK ALLQAQADFG NIGFVNPHYK AATGSLYGVV
GYHFSGRSYT TEATHNHLGI SDGGGACVSC HRNSMNGHTF QGEVTPACAT CHGTSLDEAS
LHVDHNYFLN SLEVLRAQLA AKGYAYSLTS RSFSATDWGV GQAGADTMGA AFNYALLVSE
PAAFVHNPKY AQELVIDSID YLDNRQFDDS VAGTVQALLD SGAISQEVAD SFGTYKQKNI
CLSCHGGDSI TSRPMASNGH PTHLSAAYGP DDYLRTQRSV CDACHGNDFA LHSNGTVNVL
SDACVNCHAG SVPAWNSTAR IACEVCHSAN PARLPNGVAA PSKESFATSG HGQFGASNQC
TICHNPNSSH IAGSLTSNKR LKLQNDNNLC ASCHDSVVAR QMSTHHGLAC VQCHDPHGNG
NIKMVRGTIG TQSITYLNSL NNFVDQTTNK GLCQVCHSST RYYRAGISET RHYTTGCLSC
HFHINPDGAF LPSGGACDSC HGYPPAPKNT ATSFGSYANW SGARYEDYSG GGGAHLVAAH
VSPFASPAEG WTNCTVCHNG GYHDMTTPVA EHIGNVTVMV DNNLRFADSF TVYTGAKLTN
AGPNATGSCF NIACHMSPSE RWSTER