Gene TM1040_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1671 
Symbol 
ID4075774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1770902 
End bp1772683 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content59% 
IMG OID638006984 
Productphage terminase 
Protein accessionYP_613666 
Protein GI99081512 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCT CGGAGCGCAT TGACCAGACC ACGCGATACG CGCGGGATGT GGTTTCGGGA 
AAGATCGTAG CGGGCGCATT CGTGCGGGCG CAATGCCAGC GGCACCTAGA TGATCTGAAG
CATGGCCCGG CGCGCGGGTT GCAGTGGGAT ATTGAGCAGG CCGAGCGGGC CATTCGGTTC
TTTCCGGCCA TGTTGTCGAT CACCGAAGGG GCCAAGGAGG GCGAGCCATT CAAGCTGCTG
CCCTGGCACT TGTTCGTGGT AGGGTCGATT TTCGGCTGGC GAACAGCGGA AGGTTTCATC
CGGTTCCGGT TCGTGTGGCT GGAAACCGGC AAGGGGCAAG CGAAATCGCC GCTCATGGCG
GCTGTAGGTA TCTATCTCAG CGGGTTCTAT GGCCGGAAGC GGGCCGAGGT CTACTGCATC
GGGGAAACGA AGGACACGGC GCGGGTTATG TTCCGCGATG CGGTGGCGAT GCTTCGGGCG
CCGATCCCAG GCAAAGGCGG CATGACGCTG GAAGACGCGG CGTTTGTGAT CCGCGGCACC
GGCGACCTTG CCTACAGCGT CGAGCATCCC GACAGCGGAT CATTCATGCG GCCCATCGCG
AACAATGACA GCGTTTCGGG GCCAAAGCCG ATCCTTGTGG CTGGCGATGA GATCCACGAG
ATGAAAAGCG GCAAGGCGAT TGAGATGTGG CGGGCCGCAG TCACCAAGAA ACACGGTGAC
AGCATACTGA TGCTTGGAAC GAACACCCCA AGCGCAGACC AGCAGGTGGG GACGGACTAC
AGCGAGTTTT GCCAGAAGGT GGTGACCGGC GACTTCACTG ACGACAGCGT GTTTGCATAT
ATCGCGCGGG TGGACGAGGG CGACGACCCG CTGGAAGACG AAAGCTGCTG GGTTAAGGCG
TTGCCCGCCC TGGGCATCAC CTATCCGGTG GACAACGTGC GCAAGCTGGT AGTGACGGCG
AAACAGCAGA TCAGCACGCA GCTGACCACA AAGCGCCTGT ATTTCGGAAT CCCGGTAGGC
TCTGCAGGGT TCTGGACCTC TGAGCAGGCA TGGAAGGAAG TGCAGGGCAA GGTCGATGAT
GCGAAGATGA TCGGCCGCCG GGCGCATTTG GCGCTCGACC TCTCCGAGAA GAACGACCTC
ACCGCGCTCG CGGTGGCGTG GGAGGGCGAA AGGATCGATG TCAAATCATG GTATTGGACT
CGTGAATTTG AGATCGAAGA GCGATCAACG GCAGATGCGA TCCCTTATCG CGAGTTGGAA
GCGGCGGACC TGATAGAAGT CACGCCGGGG CGCGTAATCG ACTACACGTT CATTGCGGCG
AAGATTATCG ACTTTTGTGG ACGCCACTCA GTGGTGCAGA TGGCTATCGA CAGCGCCCAC
ATGGAAAAGC TCTGCGAGGC CTTCGATAAG GCGGGTTTCG CGTATTGGAT CGAAGAAGGC
GACGACAAGC CGGGCAGCGG CCTGAAAATT GTCAGGCATA AGCAAGGCAC GAACGTCAGC
TTCGACGGAA AGTTCCTCTG TATGCCGACT TCGATCACAC AGCTTGAGGA CCACATGCTG
AAAGGCACGG TCAAGATCGA TCGCAACAAG CTCACCAGCA TCTGCGCGCG CAACGCGATC
ATCCGCGAGG ATGGTTTCGG CAACCGGATG TTCGACAAAT CACGTTCGCG GGGCCGGATC
GATGGGGTTG TCACCCTAGC GATGGCCGTA GGGTCAGCCA CCGCGGCGAT GAAAACGAAG
TCCGGATTGG ACGACTATTT TGCTTCACTT GGGGGCGGCT AA
 
Protein sequence
MSSSERIDQT TRYARDVVSG KIVAGAFVRA QCQRHLDDLK HGPARGLQWD IEQAERAIRF 
FPAMLSITEG AKEGEPFKLL PWHLFVVGSI FGWRTAEGFI RFRFVWLETG KGQAKSPLMA
AVGIYLSGFY GRKRAEVYCI GETKDTARVM FRDAVAMLRA PIPGKGGMTL EDAAFVIRGT
GDLAYSVEHP DSGSFMRPIA NNDSVSGPKP ILVAGDEIHE MKSGKAIEMW RAAVTKKHGD
SILMLGTNTP SADQQVGTDY SEFCQKVVTG DFTDDSVFAY IARVDEGDDP LEDESCWVKA
LPALGITYPV DNVRKLVVTA KQQISTQLTT KRLYFGIPVG SAGFWTSEQA WKEVQGKVDD
AKMIGRRAHL ALDLSEKNDL TALAVAWEGE RIDVKSWYWT REFEIEERST ADAIPYRELE
AADLIEVTPG RVIDYTFIAA KIIDFCGRHS VVQMAIDSAH MEKLCEAFDK AGFAYWIEEG
DDKPGSGLKI VRHKQGTNVS FDGKFLCMPT SITQLEDHML KGTVKIDRNK LTSICARNAI
IREDGFGNRM FDKSRSRGRI DGVVTLAMAV GSATAAMKTK SGLDDYFASL GGG