Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3544 |
Symbol | |
ID | 8138916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4102550 |
End bp | 4105870 |
Gene Length | 3321 bp |
Protein Length | 1106 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871163 |
Product | cytochrome C family protein |
Protein accession | YP_003023323 |
Protein GI | 253702134 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 143 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAGAA AATTGCTCAT GATACTTGTG TTGCTGCTGG CTATGCCGGC CTTTGCCAAC GCATGGTATG TCAATGCGAA GACTTCGCCG ACCAGCGGTG CGGGCACCAT CACCCCGTCG GGCAACAGGA CCTATGCCGC GGGGGTGAAC AGCGAGGAGT TCACCGTCAC TCCGGCTCCG GGCTACACCC TTTCCCGCGT CACGCTCGAC GGCGTCGCCA TCGCCCCCAA CGCAAACGGG AAGTACGTCG CACCTTACGT CTCGACCCTG ACCTGGCGCT ACATGGTGGC AGTCTTCTCG GCCGGCACGG TGAACATCAC CACGAGCGTT ACCGGTAACG GCGCCATCAC CGAGGCCAAC TATCTCTCGC TCACCTCGAT TCCGGTCGGT TCCGCCCGGA CCCTTCTGGT GGCTCCCAAC AGCGGCTACG AAATCAGCAC CTTGACCGCC AGCGGCTCCC CTTCCATCAC TATCCAGGGG GACGGCACCC GGCTGGTCAC CTACAGCAAC CTGCAGGCGA ACCAGAGCGT CACTGCCGGC TTCTCGCTGA TCCCGATCGT GGTCGCCAAC GCCGGTAGCG ACGTCACCAC TACGGGCCCA GGCGCGGCTT ACGCGGTGAC CCTCTTCGGC AGCAACAGCA CAAGCAACCA GGGGGCGATC AGCTACCAGT GGAGCGGGCC GCCGGCTTTG ATGTTCGGTT CACCCACTTC CGCCGATACC ACCGTATATT CGGACATCCC CGGGGATTAC ACGGCGACAC TCACCATCAC CTCGAACGGC ATCACCCGCT CCGATACGGC CATCGTGCAC GTCGTCACCC GTAACTCCTA CCTGGTGTCT GAGTGCACCA AGTGCCATTC CGGAAACACC ACGGCGCTCG TGGGCCTTTA CAACGGTTCG CCGCATCTTG AGACCAACGC CTGCCAGGGT TGCCACACCG ACAGCCCGCA CGTGGCGCTG CCGTCGCCCA ACGTCTGCGC CGACTGCCAT ACGGACACCT CGCGCCATCC CTTCGAGATC ACCGGCACCT GCACCTCCTG CCACAACTCG CACTCGACGT TGGTCGGGAC AGGCTCGGTG GACAGTCTGC ATTACAACAA CATCACCACC GGCATGTACC CCGCCTCCTT CGTGACCTCG CGCGCAGCCT GCGCCAACTG CCATAACAAC ACGATTTCCA ACAAGGCGAT CCGGATACAG TGGCGCGCGG CGCGGCACGC TAACATCACC TCCATGGGGT GGATCGCCAG GGACTTCAAG ACCCTTAACG GCTGCGTGCG CTGCCACACC ACCACCGGCT TCATCGCCTA TTCCACGGGC AAGGTAACGG CGGCTTGGGG CGTGGCCGAA GACAAGACCA AGGAAGTACT TACCTGCATC GGCTGTCACA GCGACAGCGT GTCCGGGGCG GTGCGCAGCA TGGCGCAGGT TGCGCCGTAC CCCGACAACA GCTTCGTCAC CCCCGACACC GGCAAGTCCA ACGTCTGCCT CCCCTGCCAC ACGGGCACCA ACAGCGGCGA GAGCATCAAG GCGCTTTTGC AGGCCCAGGC CGACTTCGGC AACATAGGTT TCGTGAACCC GCACTACAAG GCGGCCACCG GCTCCCTCTA CGGCGTGGTC GGGTACCACT TCTCGGGGCG CAGCTACACC ACCGAAGCGA CCCACAACCA TCTGGGCATC AGCGACGGCG GCGGGGCTTG CGTCTCCTGC CACAGAAACA GCATGAACGG CCACACCTTC CAGGGTGAGG TGACCCCTGC CTGCGCCACC TGCCACGGCA CCAGTCTGGA CGAGGCCTCC CTGCACGTGG ACCACAACTA TTTCCTGAAC TCCCTCGAGG TTCTGAGGGC GCAGTTGGCC GCCAAAGGCT ACGCATATTC GCTTACCAGC CGCAGCTTCA GCGCCACCGA CTGGGGCGTC GGCCAGGCGG GCGCCGACAC CATGGGCGCC GCGTTCAACT ACGCGCTGCT CGTCTCCGAA CCGGCCGCCT TCGTGCACAA CCCGAAATAC GCCCAGGAGC TGGTGATCGA TTCCATCGAC TACCTGGACA ACCGCCAGTT CGACGATTCC GTGGCCGGCA CCGTGCAGGC CCTGCTTGAC TCGGGAGCGA TCAGCCAGGA GGTCGCCGAC AGTTTCGGCA CCTATAAGCA GAAGAACATC TGCCTCTCCT GCCACGGCGG CGACTCGATC ACCTCGCGCC CCATGGCCAG CAACGGGCAC CCGACGCACC TGAGCGCCGC CTACGGTCCC GATGACTACC TGCGCACGCA GAGAAGTGTC TGCGACGCCT GCCACGGCAA CGACTTCGCC CTGCACTCCA ACGGCACCGT CAACGTGCTG AGCGACGCCT GCGTGAACTG CCACGCCGGT TCGGTTCCCG CCTGGAACTC CACGGCCCGG ATCGCCTGCG AGGTCTGCCA CTCGGCGAAC CCGGCGAGGC TTCCCAACGG AGTGGCGGCG CCCTCCAAGG AGAGTTTCGC CACCTCCGGC CACGGGCAGT TCGGCGCCAG CAACCAGTGC ACCATCTGCC ACAACCCCAA CAGCAGTCAC ATCGCCGGCA GCCTCACCAG CAACAAAAGG CTGAAGCTGC AAAACGACAA CAACCTCTGC GCCTCCTGCC ACGACAGCGT GGTGGCACGG CAGATGTCGA CTCACCATGG CCTTGCCTGC GTGCAGTGCC ACGATCCGCA CGGAAACGGC AACATCAAGA TGGTACGGGG CACGATCGGG ACGCAGAGCA TCACCTACCT GAACTCCTTG AACAACTTCG TCGACCAGAC TACCAACAAG GGGCTTTGCC AGGTCTGCCA CAGCTCCACC CGCTACTACC GCGCGGGGAT CAGCGAGACC AGGCACTACA CTACGGGATG CCTCAGCTGC CACTTCCACA TCAACCCCGA CGGCGCCTTC TTGCCCAGCG GCGGCGCCTG CGACTCCTGC CACGGCTATC CGCCCGCTCC CAAGAACACG GCCACCAGCT TCGGCAGCTA CGCCAACTGG TCCGGCGCGC GCTACGAGGA CTACTCCGGA GGCGGCGGCG CCCATCTGGT GGCGGCACAC GTCTCCCCCT TCGCCAGTCC GGCCGAGGGA TGGACCAACT GCACCGTCTG CCACAACGGC GGCTACCATG ACATGACCAC ACCGGTGGCG GAACACATCG GCAACGTCAC CGTGATGGTG GACAACAACC TGCGCTTTGC CGACAGCTTC ACGGTTTACA CCGGCGCGAA GCTGACCAAC GCCGGGCCGA ACGCCACGGG AAGCTGCTTC AACATCGCCT GCCACATGAG CCCGTCGGAA AGGTGGAGTA CGGAAAGATA G
|
Protein sequence | MFRKLLMILV LLLAMPAFAN AWYVNAKTSP TSGAGTITPS GNRTYAAGVN SEEFTVTPAP GYTLSRVTLD GVAIAPNANG KYVAPYVSTL TWRYMVAVFS AGTVNITTSV TGNGAITEAN YLSLTSIPVG SARTLLVAPN SGYEISTLTA SGSPSITIQG DGTRLVTYSN LQANQSVTAG FSLIPIVVAN AGSDVTTTGP GAAYAVTLFG SNSTSNQGAI SYQWSGPPAL MFGSPTSADT TVYSDIPGDY TATLTITSNG ITRSDTAIVH VVTRNSYLVS ECTKCHSGNT TALVGLYNGS PHLETNACQG CHTDSPHVAL PSPNVCADCH TDTSRHPFEI TGTCTSCHNS HSTLVGTGSV DSLHYNNITT GMYPASFVTS RAACANCHNN TISNKAIRIQ WRAARHANIT SMGWIARDFK TLNGCVRCHT TTGFIAYSTG KVTAAWGVAE DKTKEVLTCI GCHSDSVSGA VRSMAQVAPY PDNSFVTPDT GKSNVCLPCH TGTNSGESIK ALLQAQADFG NIGFVNPHYK AATGSLYGVV GYHFSGRSYT TEATHNHLGI SDGGGACVSC HRNSMNGHTF QGEVTPACAT CHGTSLDEAS LHVDHNYFLN SLEVLRAQLA AKGYAYSLTS RSFSATDWGV GQAGADTMGA AFNYALLVSE PAAFVHNPKY AQELVIDSID YLDNRQFDDS VAGTVQALLD SGAISQEVAD SFGTYKQKNI CLSCHGGDSI TSRPMASNGH PTHLSAAYGP DDYLRTQRSV CDACHGNDFA LHSNGTVNVL SDACVNCHAG SVPAWNSTAR IACEVCHSAN PARLPNGVAA PSKESFATSG HGQFGASNQC TICHNPNSSH IAGSLTSNKR LKLQNDNNLC ASCHDSVVAR QMSTHHGLAC VQCHDPHGNG NIKMVRGTIG TQSITYLNSL NNFVDQTTNK GLCQVCHSST RYYRAGISET RHYTTGCLSC HFHINPDGAF LPSGGACDSC HGYPPAPKNT ATSFGSYANW SGARYEDYSG GGGAHLVAAH VSPFASPAEG WTNCTVCHNG GYHDMTTPVA EHIGNVTVMV DNNLRFADSF TVYTGAKLTN AGPNATGSCF NIACHMSPSE RWSTER
|
| |