Gene GM21_3658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3658 
Symbol 
ID8139032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4233785 
End bp4235986 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content62% 
IMG OID644871279 
Productphage tail tape measure protein, TP901 family 
Protein accessionYP_003023437 
Protein GI253702248 
COG category 
COG ID 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones159 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCTC TCTTCAAATT GGGCATATTG CTGCGGATCA CGGATATGGT TTCGGGTCCG 
GTGGCGAAGA TCTCGGGAAC GATCGACACC CTGCACGCTA AGGCTATGAA GCTGCAGCCG
GTGTTCGACA AGTTCCGGGA CTATGGCCAA TGGATAGCCG GCGCCGGGAT CGCGGGCGCG
CTGGGCCTCG GGATAGCGGT GACGTCGTTC GCCAATCTGG AGGAGGCGCA GTTGGGACTC
CGGACGCTGC TCATGGACTC AACCGGTCAG GTGGGGGCGG AGTACTCCAG GCTGAACTCG
TTGGCCGAGG ATCTGGGGAC CTCGCTGCCA GGCTCGACCA GGGACATGAT CGGTATGTTC
ACCGCGCTGC GCGAGCAGGG GGTGCAGACG AACGTGATCT TGGGGGGCAT GGGGGAGGCC
GCGGCGAAAT TCGCGGTGCT GATGAAGGTC TCCTTTGCCG AGGGGGCAAC GCACGTCGCC
AAGTTTTCCG AAGCCCTGGG GATCGCCGAC AAGGAGGCGG TCCCGTTCAT GGATACCCTG
CAGCGGCTGA AGGGTGCTGC AGGGGTTAAC GTTGCCGATT TGTCGGAGAG CCTGAAGTAC
GCCGGCTCCT CGCTCAAGGC GCTGCGCATC CAGGGGCTGG AGGCCGGAAG GGACGTAAGC
GCCGCGATCG GGCTCATGGC CACCTCGAGC ATCGAGGGGA GCCAGGCCGG TACCAACTTT
GCCCAGGCGT TGACGAGGAT GGCGGAGATC AGCAGCAAGC TGGACTCCGG CAGGATCGCG
AAGATGGTCG GCCCCATCCT GGACGCCAAG GGAATCAAGC TCAATTTCTT TGACGAGGCA
GGCAACTTCG TCGGGATCCG GAACATGATG GGGGAGCTGG AAAAGCTTCG CGCCATCAAC
CCCCAGGAAC AGTTGATCGT TCTTTCCAAG CTTTTCGGCG CCGAGGCGTC GCGGCCTCTG
TCGGTATTCA TCAACCAGGG CGTGGCCGGC TTCGACGCCA TGCTTGAGAA GATGCGGAAC
CAGGCGGACA TGCAGACCAA GATCAACGAG ATCATGAGCG GCACCAAGAT GCAGTGGGAG
ACGCTCACCG GCACCTTGGG CAATGTGGTG GCCCACATCG GCGCCGTGGT GACCAAGTCG
GCCGGACTGA ATGCTGTGAT GAGGGTGGCC AACGACCTGG CTGGCCGCAT GGACTCCTGG
ATCATAGCTC ACCCCCGCAC GGCTGGGATC ATCGGCGGCT TGGCCGTCGC AGTTACCGGC
GCAGCGCTTG CTATCGGCGG GCTACTCCTG GTCATAGGGC TAGGCGGCAC GCTGGCGACC
AAGATGATGG TCGGCTATGG ATTGCTGGTT CAGGGCGTAA TGCTGTTGAA GCTGGCCCTG
GGCGCGCTTA TCCCGGTGGT TTGGAGTTTT ACCGCGGCGC TGTTGGCCAA CCCTATGACC
TGGATAGTGC TGGCGATCGT CGCAGGAGTA GCAGTCGTTG GCGGGGCCAT CGTCTGGATG
TACCGCCGGG TCGAGTGGTT TAAGACCATG ATGGACGGCT TCTTGTTCTT CCTTGGGTTC
AGCATCGGGT TGATCGCTAA GGGGTGGAAG AATCTTGCAG GCTGGGTGTC TCTGCCTTTC
TCCGCCATCT GGTCAGTGAT CGCCAGATTG ATCGCCGCCC TGCCGAAGAT CAGCAGCGCT
TTTTCAAATG CCATGGCGGG TATGCTGAAC GCCCTACCGG CCATCCTGGG CGGGCTGTTC
AGATCGGGGC AAAAGATCGT ATCGACCATG GTAGACGGCA TCAGGTCAAT GGCGGGCGCT
GTCCCGGGAG CGATCAAGGA TATTTTCGGC AAGGTTAGGA ACCTGCTCCC TTTCTCCGAC
GCCAAGGAAG GGCCGCTTTC GCAACTGACG CTGTCGGGAT CCCGGATCAT GTCCACCCTG
GGCGACGGGA TTACCGGGGC TGCTCCCGGG CTCCATAAGA CCATGGCGAC AGCGCTGGCC
GGTGCTGCGC TGACCACGAG CATCGCGGTG GCTCCTCCCC CCTCCTTCGC GGCCGATGCA
GTCGGAAAGG CGGCGGTTTC CGCAGGCAAG TCTGCAGGAG CCGACTCCGG CGGCAAAAAG
CTCGTGATCC ACATCGAAAA GCTCGAGCTG CCCGGAGTGA GCAACGCCGC CAACTTCGTG
GCGCAGCTGC AGGCGCTGGT GGAGGCATAC GATGGCGAGT GA
 
Protein sequence
MESLFKLGIL LRITDMVSGP VAKISGTIDT LHAKAMKLQP VFDKFRDYGQ WIAGAGIAGA 
LGLGIAVTSF ANLEEAQLGL RTLLMDSTGQ VGAEYSRLNS LAEDLGTSLP GSTRDMIGMF
TALREQGVQT NVILGGMGEA AAKFAVLMKV SFAEGATHVA KFSEALGIAD KEAVPFMDTL
QRLKGAAGVN VADLSESLKY AGSSLKALRI QGLEAGRDVS AAIGLMATSS IEGSQAGTNF
AQALTRMAEI SSKLDSGRIA KMVGPILDAK GIKLNFFDEA GNFVGIRNMM GELEKLRAIN
PQEQLIVLSK LFGAEASRPL SVFINQGVAG FDAMLEKMRN QADMQTKINE IMSGTKMQWE
TLTGTLGNVV AHIGAVVTKS AGLNAVMRVA NDLAGRMDSW IIAHPRTAGI IGGLAVAVTG
AALAIGGLLL VIGLGGTLAT KMMVGYGLLV QGVMLLKLAL GALIPVVWSF TAALLANPMT
WIVLAIVAGV AVVGGAIVWM YRRVEWFKTM MDGFLFFLGF SIGLIAKGWK NLAGWVSLPF
SAIWSVIARL IAALPKISSA FSNAMAGMLN ALPAILGGLF RSGQKIVSTM VDGIRSMAGA
VPGAIKDIFG KVRNLLPFSD AKEGPLSQLT LSGSRIMSTL GDGITGAAPG LHKTMATALA
GAALTTSIAV APPPSFAADA VGKAAVSAGK SAGADSGGKK LVIHIEKLEL PGVSNAANFV
AQLQALVEAY DGE