Gene TM1040_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1302 
Symbol 
ID4078501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1394279 
End bp1396231 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content62% 
IMG OID638006610 
Productphage terminase GpA 
Protein accessionYP_613297 
Protein GI99081143 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.656027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCCC GGGCTGATTT CTCGAATGCG CGGGCTGTTG TCCGCATTGC TCGGCGCGCC 
CGGGCATTTC TGCGCCCGCC GCCGAACCTA AAGCCGTCGG AGTGGGCCGA GGAAAATATC
AAGATCCCCG TCGGAAACGC TGTGCCGGGC AAAATGCGGT TCGACAACGC GCCCTATCAG
CGCGAAGTGA TCGATATGAC CGCAGATCCG CGGTGCAACC GGATCTCGCT CATGTGGGGC
GCGCAGGTCG GCAAGACGCA GACCGCGCTT GCGGCGCAGG CCTATCGCAT CGGCTTCAAT
CCTGTGTCTC AGATGATGAT GCAGCCGAGC CAAGGCGATT TGACGACGTG GCTCGAGACC
AAATTCAACC CGCTTGTTGA AGAGAACGAA GACCTCGCCG AGCTGATCGC GAAGCCGCGC
GGGCGCCAAG GCGTCAATAA TCAGCGCATG AAGAGTTACC CGGGCGGGTT TCTCATGTTC
AGTTGGTCGG GTTCTCCGAA GACCATGCGC GGACGTTCGG CGCCGTTCAT TGTTTGCGAT
GAAACCGACG GCTACGACCG GACAAACGAA GGCCACCCGG TCGGCCTGCT GTGGCAACGG
GCGGCGACCT TTGGTGATCA GCGGCTCTTG TTGGAAATCA GCACGCCGAC GATCAAGGGC
GGCAGCTGGA TTGAAAAGTC CTTTGAGGCA GGAGACCAGC GCTATTTTTA CGTGCGGTGT
CCGCATTGCG GCCACCTGCA ACGGCTGAAC TGGTCGCAGG TGACCTGGTC CAAAGACGCC
GACGGGCTGC ACCTCGCCGA AACAGCAGGC TATCTGTGCG TGGGTGAGGG ATGCGGAACC
GTTTGGAGCG ATGGCGAGCG AGTGGCAGCG ATCCGCAACG CCGAACGCGA CGGCGGCGGC
TGGATCGCCA GCAAGCCGTT CCGAGGTCAC GCGTCCTATC ATTTGTCGGA GCTGTATTCC
TGCTTTCGGC GGCTCGAGGA TATCGTGCAG TCCTTCCTCG ACAAAAAGGC CGCGGGGGAT
TTGCAAACCT TTGTGAACGT GTCGCTCGCT GAGACATGGG AAGAGGAAGG CGACAAGCTC
GAGGCGTCCG TGCTGATGGC ACGGGCTGCT AAGTTCGCAG CACCGGTCCC GTTGGGGGCA
GGCGTCCTGA CGGCTGGTAT TGATATGCAA AACGACCGCC TCGAGGTCGA AATAGTTGGC
TGGGGCTTGG GGGAGGAATC CTGGTCTGTC GATTATCGGG TTTTGTGGGG CGATCCGCTG
CAACAGGACG TCTGGGACGA ACTAGACGCC TTGCTGTCGG AAACATGGGA GCACGAGAGC
GGAGCGGAGT TGCGGGTCTC TGCCGCCTGC CTCGATACCG GGGGTGAAGG TGGGCGCACG
CAAGCAGCCT ATGACTATGC CCGCAAGCGT TTGGGCCGCA AGGTCTGGGC GATCAAGGGC
GTCGGTGGCT GGGGCAGACC CATCGTTACG CAGCCCTCGA AGGTCAAACA AAAGGGCGTG
CGCCCCGTCT ACTTGCACTC CATCGGCGTC GATGAGGCGA AAGCCGTAGT CGCCCAGCGG
GCGCGGATTA GCGACGCGGG GCCGGGCCAT TGCCATTTCC CGGCAGATCG GGATCCGGCG
TGGTTCGATA TGTTCACCGC CGAGGCGCTG CGCACCCGGT ATGTGAAGGG GTTCGCAGTC
CGAGAATGGC ACAATGTACG CCCACGCAAC GAAGCATTTG ACTGCCGCGT CTATGCCTAC
GCAGCGCTGC GCATCCTGCG CCCGAATATC AAACGCCTGG TCGCGTCACT GGATGTTCAG
GGGCAGGAGA CCGAGGACGA GGCCGTCGAG ATTTCACCAG ACGCTGAGCG AGCTTTGGTA
AAACCGCCAG AGGAACCGGA AGCCCCAAAC AGCCAAGCTC CGAAAAAGAC TGGCTGGGGG
GCGAAGAAGC GCCGACGCCG TCGCAGGTAT TGA
 
Protein sequence
MNARADFSNA RAVVRIARRA RAFLRPPPNL KPSEWAEENI KIPVGNAVPG KMRFDNAPYQ 
REVIDMTADP RCNRISLMWG AQVGKTQTAL AAQAYRIGFN PVSQMMMQPS QGDLTTWLET
KFNPLVEENE DLAELIAKPR GRQGVNNQRM KSYPGGFLMF SWSGSPKTMR GRSAPFIVCD
ETDGYDRTNE GHPVGLLWQR AATFGDQRLL LEISTPTIKG GSWIEKSFEA GDQRYFYVRC
PHCGHLQRLN WSQVTWSKDA DGLHLAETAG YLCVGEGCGT VWSDGERVAA IRNAERDGGG
WIASKPFRGH ASYHLSELYS CFRRLEDIVQ SFLDKKAAGD LQTFVNVSLA ETWEEEGDKL
EASVLMARAA KFAAPVPLGA GVLTAGIDMQ NDRLEVEIVG WGLGEESWSV DYRVLWGDPL
QQDVWDELDA LLSETWEHES GAELRVSAAC LDTGGEGGRT QAAYDYARKR LGRKVWAIKG
VGGWGRPIVT QPSKVKQKGV RPVYLHSIGV DEAKAVVAQR ARISDAGPGH CHFPADRDPA
WFDMFTAEAL RTRYVKGFAV REWHNVRPRN EAFDCRVYAY AALRILRPNI KRLVASLDVQ
GQETEDEAVE ISPDAERALV KPPEEPEAPN SQAPKKTGWG AKKRRRRRRY