Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3108 |
Symbol | |
ID | 8138458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3603447 |
End bp | 3605108 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644870712 |
Product | cytochrome c family protein |
Protein accession | YP_003022894 |
Protein GI | 253701705 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 149 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAA TGAAGCAAGG GGTCTTGTTG CTGGTCTTTG GGGCGCTCCT CGTCTCAGGG GGGGGGGAGG CTGCCCACGG CGCCAACAGC GTGCTCGCCA CCAAACACAA CCTCTCGGTC TCCGGCCCCG GCGAGATCAA GGCGGTCAGC GAAGAGCGCG TCTGCATCTT CTGCCACGCC CCGCATCACG CCAACCCCTC CACGCCGCTT TGGAGCCGCG ACACCAGCTC ATCGGAGTAC GACCTGTACG ATTCCACCAC GCTCGTGGCG AAGCCCGGGC AGCCCAGCGG CTCGGCGCGC CTTTGCCTTA GCTGCCACGA CGGGACCATC GCGCTCGGGG CCCTTTTCGG CAGCGACCGG AACGTCAACG CCATCAGCAT GGCGGGAGGG TTCGTGAAAC TCCCGGCGGG GAGGAGCAGC AACCTCGCCG GGACGACGGG AAGGGACCTC GCCAACGATC ACCCGATCTC CTTTCCCTAC ACCAACGAGC TGGCGCAGCT AAACGGCCAG TTGAACCTTC CCGGTTCGCT CCCGTCCCAG GTCCGGCTGG AACAGGGGAA CACGCTCCAG TGCACCGCCT GCCACAATCC GCACAAGAAC CCTTACGGCA AGTTCCTGGT GGTCGACAAC TCCCGTTCGC AGCTTTGCAT CTTCTGCCAC AACATCGCCG GATACGCATC CTCGCGGCAC GCGACCACGG CGAGCCTTAC CGCGGGTTGC AACCTCTGCC ATGCCACCCA CAACGCGGGG GGGAAAAAGC GGCTTCTGGG ACATGCCGCC GAGGAGGAGA ACTGCTACCA GTGCCATAGC GACCAGGGGG GGGCGAAGGA CGTGAGGGTC CCCGCCGCCA AGTTCTACAG CCACCCCATG AGCGCGACTA CAGGGGTGCA CGATCCCAAG GAAGATCCGC TCACCGCCGA AAAGCACGTG GAGTGCTCCG ACTGCCACAA CCCGCACAAG GTTGTCGCGA CCGCCGCCTC CGCGCCTGTG GCCTCAGGGG TTAATGCAGG CGTGGGCGGG GTCTCCATCT CGGGGGAGGT GCTCCCGGGC GACGCCACCT ACCAGTACCA GATCTGCTTT CGCTGCCATG CCGAAAACAA TTTCAGCGGG AGCCAGACCG TGATCAGGCA GATCCAGGAC GTCAACACCA GGCTCGACTT CGACCCGGCC AACCCTTCCT ACCACCCCGT GGCGGCGATA GGGAAGGGGA ACAGCGTCCC AAGCCTGCGC ACCAATTACA GCACGGCCAG CATGATCTAC TGCACCGACT GCCACGGCAA CGACGACGCC ACCCAGGCCC GCGGCCCCCA CGGTTCCAAC CTGAAGCACA TCCTCGTCGC ACGCTACGAG AGCGACACCT ACCCGCTCAC CTACAACGAG GAGAGTTACG CCCTTTGCTA CCGCTGCCAC GACCAGCTCG TGCTCCTGGA TCCCTTGAGG TCGTCCTTCG CGCCGCATGC ACGGCACGTG GTCGACAACA GGGTCCCCTG CTCGGTCTGC CACGATCCGC ACGGCGTCTC CGCGACCCGG GGCGCAGGCA CCACCGCCAA CGCCCACCTG ATCAACTTCG ACATCCGCTT CGTCGCCGCC GGGGGGAGCT ACAACTCCGT GGGGAGGAGC TGCACGGTGA GCTGCCACGC CGTGAACCCC CGCCTCTACT GA
|
Protein sequence | MSKMKQGVLL LVFGALLVSG GGEAAHGANS VLATKHNLSV SGPGEIKAVS EERVCIFCHA PHHANPSTPL WSRDTSSSEY DLYDSTTLVA KPGQPSGSAR LCLSCHDGTI ALGALFGSDR NVNAISMAGG FVKLPAGRSS NLAGTTGRDL ANDHPISFPY TNELAQLNGQ LNLPGSLPSQ VRLEQGNTLQ CTACHNPHKN PYGKFLVVDN SRSQLCIFCH NIAGYASSRH ATTASLTAGC NLCHATHNAG GKKRLLGHAA EEENCYQCHS DQGGAKDVRV PAAKFYSHPM SATTGVHDPK EDPLTAEKHV ECSDCHNPHK VVATAASAPV ASGVNAGVGG VSISGEVLPG DATYQYQICF RCHAENNFSG SQTVIRQIQD VNTRLDFDPA NPSYHPVAAI GKGNSVPSLR TNYSTASMIY CTDCHGNDDA TQARGPHGSN LKHILVARYE SDTYPLTYNE ESYALCYRCH DQLVLLDPLR SSFAPHARHV VDNRVPCSVC HDPHGVSATR GAGTTANAHL INFDIRFVAA GGSYNSVGRS CTVSCHAVNP RLY
|
| |