Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3658 |
Symbol | |
ID | 8139032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4233785 |
End bp | 4235986 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871279 |
Product | phage tail tape measure protein, TP901 family |
Protein accession | YP_003023437 |
Protein GI | 253702248 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 159 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCTC TCTTCAAATT GGGCATATTG CTGCGGATCA CGGATATGGT TTCGGGTCCG GTGGCGAAGA TCTCGGGAAC GATCGACACC CTGCACGCTA AGGCTATGAA GCTGCAGCCG GTGTTCGACA AGTTCCGGGA CTATGGCCAA TGGATAGCCG GCGCCGGGAT CGCGGGCGCG CTGGGCCTCG GGATAGCGGT GACGTCGTTC GCCAATCTGG AGGAGGCGCA GTTGGGACTC CGGACGCTGC TCATGGACTC AACCGGTCAG GTGGGGGCGG AGTACTCCAG GCTGAACTCG TTGGCCGAGG ATCTGGGGAC CTCGCTGCCA GGCTCGACCA GGGACATGAT CGGTATGTTC ACCGCGCTGC GCGAGCAGGG GGTGCAGACG AACGTGATCT TGGGGGGCAT GGGGGAGGCC GCGGCGAAAT TCGCGGTGCT GATGAAGGTC TCCTTTGCCG AGGGGGCAAC GCACGTCGCC AAGTTTTCCG AAGCCCTGGG GATCGCCGAC AAGGAGGCGG TCCCGTTCAT GGATACCCTG CAGCGGCTGA AGGGTGCTGC AGGGGTTAAC GTTGCCGATT TGTCGGAGAG CCTGAAGTAC GCCGGCTCCT CGCTCAAGGC GCTGCGCATC CAGGGGCTGG AGGCCGGAAG GGACGTAAGC GCCGCGATCG GGCTCATGGC CACCTCGAGC ATCGAGGGGA GCCAGGCCGG TACCAACTTT GCCCAGGCGT TGACGAGGAT GGCGGAGATC AGCAGCAAGC TGGACTCCGG CAGGATCGCG AAGATGGTCG GCCCCATCCT GGACGCCAAG GGAATCAAGC TCAATTTCTT TGACGAGGCA GGCAACTTCG TCGGGATCCG GAACATGATG GGGGAGCTGG AAAAGCTTCG CGCCATCAAC CCCCAGGAAC AGTTGATCGT TCTTTCCAAG CTTTTCGGCG CCGAGGCGTC GCGGCCTCTG TCGGTATTCA TCAACCAGGG CGTGGCCGGC TTCGACGCCA TGCTTGAGAA GATGCGGAAC CAGGCGGACA TGCAGACCAA GATCAACGAG ATCATGAGCG GCACCAAGAT GCAGTGGGAG ACGCTCACCG GCACCTTGGG CAATGTGGTG GCCCACATCG GCGCCGTGGT GACCAAGTCG GCCGGACTGA ATGCTGTGAT GAGGGTGGCC AACGACCTGG CTGGCCGCAT GGACTCCTGG ATCATAGCTC ACCCCCGCAC GGCTGGGATC ATCGGCGGCT TGGCCGTCGC AGTTACCGGC GCAGCGCTTG CTATCGGCGG GCTACTCCTG GTCATAGGGC TAGGCGGCAC GCTGGCGACC AAGATGATGG TCGGCTATGG ATTGCTGGTT CAGGGCGTAA TGCTGTTGAA GCTGGCCCTG GGCGCGCTTA TCCCGGTGGT TTGGAGTTTT ACCGCGGCGC TGTTGGCCAA CCCTATGACC TGGATAGTGC TGGCGATCGT CGCAGGAGTA GCAGTCGTTG GCGGGGCCAT CGTCTGGATG TACCGCCGGG TCGAGTGGTT TAAGACCATG ATGGACGGCT TCTTGTTCTT CCTTGGGTTC AGCATCGGGT TGATCGCTAA GGGGTGGAAG AATCTTGCAG GCTGGGTGTC TCTGCCTTTC TCCGCCATCT GGTCAGTGAT CGCCAGATTG ATCGCCGCCC TGCCGAAGAT CAGCAGCGCT TTTTCAAATG CCATGGCGGG TATGCTGAAC GCCCTACCGG CCATCCTGGG CGGGCTGTTC AGATCGGGGC AAAAGATCGT ATCGACCATG GTAGACGGCA TCAGGTCAAT GGCGGGCGCT GTCCCGGGAG CGATCAAGGA TATTTTCGGC AAGGTTAGGA ACCTGCTCCC TTTCTCCGAC GCCAAGGAAG GGCCGCTTTC GCAACTGACG CTGTCGGGAT CCCGGATCAT GTCCACCCTG GGCGACGGGA TTACCGGGGC TGCTCCCGGG CTCCATAAGA CCATGGCGAC AGCGCTGGCC GGTGCTGCGC TGACCACGAG CATCGCGGTG GCTCCTCCCC CCTCCTTCGC GGCCGATGCA GTCGGAAAGG CGGCGGTTTC CGCAGGCAAG TCTGCAGGAG CCGACTCCGG CGGCAAAAAG CTCGTGATCC ACATCGAAAA GCTCGAGCTG CCCGGAGTGA GCAACGCCGC CAACTTCGTG GCGCAGCTGC AGGCGCTGGT GGAGGCATAC GATGGCGAGT GA
|
Protein sequence | MESLFKLGIL LRITDMVSGP VAKISGTIDT LHAKAMKLQP VFDKFRDYGQ WIAGAGIAGA LGLGIAVTSF ANLEEAQLGL RTLLMDSTGQ VGAEYSRLNS LAEDLGTSLP GSTRDMIGMF TALREQGVQT NVILGGMGEA AAKFAVLMKV SFAEGATHVA KFSEALGIAD KEAVPFMDTL QRLKGAAGVN VADLSESLKY AGSSLKALRI QGLEAGRDVS AAIGLMATSS IEGSQAGTNF AQALTRMAEI SSKLDSGRIA KMVGPILDAK GIKLNFFDEA GNFVGIRNMM GELEKLRAIN PQEQLIVLSK LFGAEASRPL SVFINQGVAG FDAMLEKMRN QADMQTKINE IMSGTKMQWE TLTGTLGNVV AHIGAVVTKS AGLNAVMRVA NDLAGRMDSW IIAHPRTAGI IGGLAVAVTG AALAIGGLLL VIGLGGTLAT KMMVGYGLLV QGVMLLKLAL GALIPVVWSF TAALLANPMT WIVLAIVAGV AVVGGAIVWM YRRVEWFKTM MDGFLFFLGF SIGLIAKGWK NLAGWVSLPF SAIWSVIARL IAALPKISSA FSNAMAGMLN ALPAILGGLF RSGQKIVSTM VDGIRSMAGA VPGAIKDIFG KVRNLLPFSD AKEGPLSQLT LSGSRIMSTL GDGITGAAPG LHKTMATALA GAALTTSIAV APPPSFAADA VGKAAVSAGK SAGADSGGKK LVIHIEKLEL PGVSNAANFV AQLQALVEAY DGE
|
| |