Gene TM1040_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1670 
Symbol 
ID4075773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1769678 
End bp1770901 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content59% 
IMG OID638006983 
ProductPhage portal protein, HK97 
Protein accessionYP_613665 
Protein GI99081511 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTGC GAGATCGCTT GAAATCCATG ATCATTCGCC GCTTTGGGCT GACTGATGCT 
CACCAGATGG GACTTCATCG TGCCAGTGAT GCGGGGGAGA TCGTCACCGG GCATAGCGCG
CTCGGAATTT CGACGGTTTG GGCCTGCACC AACTTGATTG CGGGAACAAT CGGCTCGCTT
CCGTTGATGG TGTATCGCCG GAACGGCCAA ACGCGCACCG TTGACCGCTC GCACGTCGTT
TACCGACTGC TGCATGACAG CCCGAACTAT GATCAGACAG CGCTGGACTT CTGGGAATTT
ATGGCGGCGT CTCTGGAGCT GTGGGGCAAC GCCTACGCGC ATATCTTGCG GGAAAATGGC
AAGATCGTCG GCCTAGTGCC TGTCGCGCCA GATCTGATGA GGGTTCGCCG TCTGCCGACC
GGTGAAATCG AATATCGTTG GTCGGAAGAT GGCAAAACCC ATCGGGAGCT TGACGGCGCT
GTGCTGCACG TTCGCGGTTT TGGAGGTTCG CCGCTTGGCG GCATGTCTAC CCTACAATTC
GCTCGCAATG CTTTCGGCTT GGCACGTGCG GTTGACCGAG CAGCGGGTGA GACGTTCAAG
AACGGGATGC GCCCCTCTGG CGCGTTGAAA TTTGACAATT GGCTGACTGA TGAGCAGCGG
GCCCGGGCCA AATCCACTTT GGTTGACGAC ATGGTGGGGG CGCAGAACTC CGGGCGCCCA
ATTGTTCTGG AGGGTGGCAC CAATTGGGTG CCATTCACGA TCAACCCCGA TGATGCGCAA
ATGCTGGAAA GCCGACGCTT CTCGGTGGAG GAAATCTGCC GGTTTTTCGG CGTGCCGCCG
CATATGGTTG GGCACACAGA GAAAAGCACG AGTTGGGGAA CTGGCTTGGA GCAGCAAACC
CTTGCGTTTC AGAAGTTCAC CCTTCGCCGC CGCATCAAGC GGATTGAGCA GGCGCTGATG
AAACAGCTCC TGACCCCTGC TGAACGGGCG CGCGGGCTGA TGATCGAATT CAACCTGGAA
GGGCTGCTTC GCGGAGACAG CAAGTCGCGC GCCGATTTCT ACCAGTCTGG CCTGCAGAAC
GGCTGGCTGA CCATCAATGA GGTGCGCGCG CTGGAGAACA AGCCCCCGGT GGCGGGCGGC
GAGGTGCCGC GAATGCAGAT GCAGAACGTG CCGATCACCG AGGTAGGCAA GCAATTGGAG
GCTGGAAATG ACGATGATGC ATAA
 
Protein sequence
MGLRDRLKSM IIRRFGLTDA HQMGLHRASD AGEIVTGHSA LGISTVWACT NLIAGTIGSL 
PLMVYRRNGQ TRTVDRSHVV YRLLHDSPNY DQTALDFWEF MAASLELWGN AYAHILRENG
KIVGLVPVAP DLMRVRRLPT GEIEYRWSED GKTHRELDGA VLHVRGFGGS PLGGMSTLQF
ARNAFGLARA VDRAAGETFK NGMRPSGALK FDNWLTDEQR ARAKSTLVDD MVGAQNSGRP
IVLEGGTNWV PFTINPDDAQ MLESRRFSVE EICRFFGVPP HMVGHTEKST SWGTGLEQQT
LAFQKFTLRR RIKRIEQALM KQLLTPAERA RGLMIEFNLE GLLRGDSKSR ADFYQSGLQN
GWLTINEVRA LENKPPVAGG EVPRMQMQNV PITEVGKQLE AGNDDDA