Gene TM1040_1625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1625 
Symbol 
ID4077727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1732375 
End bp1734075 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content60% 
IMG OID638006938 
Productphage terminase 
Protein accessionYP_613620 
Protein GI99081466 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000276041 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.989877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATG CGTTCATTGA TCCTGCTGAG CGGGCGGCTT GGTCCACGGC GGTTCCGGAC 
TGGGAAGAGC GGATCCTCAA TCGGCAATCG CTGATCCCGG ATCTTCCGTT GTGGGACGAG
CCAGCTGAGC GGGCGCTGCG GATCTTCAAA CGGCTTCGGG TTCCCGATCT TATCGGGCAA
CCTACCTACG GCGAGGTCAG CGATCAGTGG GTCTTTGATC TGGTGCGTGC CATCTTCGGC
AGCTACGACC CAGTCAAGAA ACGGCGGATG CTGCGGGAAT TTTTCCTGCT GATCCCAAAG
AAGAACGGGA AGTCGGCGAT CGCGGCGGCG ATCATCCTGA CCGCCTGCAT CATGAATGAA
CGGCCTGAAG CGGAGCTGCT GCTGATCGCC CCTACGATGA CGATCGCCAA GATCTCCTTC
AAACAGATCA AGGGGATCAT CCGGGCCGAT CCGGAGCTGG ACAAGCGGTT CCATATTCAA
GATCACGCCC GGATGATCAC GCATCTGGTC AGCAAGGCGG AGATCTCGGT AAAGGCCGCT
GATGGGGACG TCATCACCGG TGGCAAGGCC ACCTATACGA TGATTGACGA AACCCACGAG
TTTGCCCGCA AGAGCAAGGC GGACGGAGTT TTTCTCGAAC TGCGGGGGGC ATTGGCATCC
CGGCCTGAAG GGTTTGTGAT GCAGATCACT ACCCAATCAA AGGAACAGCC CGCCGGCGTG
TTCAAGGCTG AGCTCGAGAC CGCGCGTGCC GTGCGGGACG GGCGGTTGCA GTCGCCGATG
TTGGCCGTGC TCTATGAGTT GCCCAAAAAG CTGGCCAAAA GCTGGCAGAA GCAAGAGACT
TGGGCGCTGG TCAATCCGCA CCTCGGCCGA TCTGTCGATC CGGCCTTCCT GCAGGACCAG
CTGGTCAAGG CGCGTGAAAA AGGGCCGAAA GAGCTGCAGC TGTTGGCCTC TCAGCACTTC
AACGTCGAAA TCGGCGTCGG CCTCGGTGGC GGATGGACCG GCGCGCACTA TTGGAAGAAA
GCAGGGCCGC AGACGTTCGG CCTTGATGAG TTGATTGCCC GGTCTGACGT AGCGGTTGTT
GGGCTAGACG GCGGCGGTCT GGATGACCTG TTTGGGCTGG CCGTGGTCGG GCGCGAGATC
GAGACCAAAA ATTGGTTGAT GTGGTTCCAC GCCTGGGCGC ATCCAGAAGT GCTGCGGGTG
CGTAAGGAGA TTGCGTCGCG TCTGGGTGAT TTCGCCAAGG CTGGCGATCT TATTCTACTG
GGTGAGGACG AGCCAACGGG AGATATCGAG GGCGCAGCGC GGATTGTTGG CAAACTTCTC
GAGGCGCAGT TGCTGCCGGA GGAGGCCGCA ATCGGGCTGG ATACGGTGCA GGTCTACGCG
ATCCTCGAAG AATTGATGTC GATCGGTGTC GCGGAAGATC AGCTACGCAA CATCGGTCAA
GACTGGCGCT TGTCACCGGC GATCTGGGGC ATGGAGCGGA AGCTGAAAGA CGGCACGCTG
TTGCACAGCG GGCAACCGAT GATGGAGTGG GTGCTTGGCA ACGCCAAGGT TGAACAGCGC
GGTTCTGCCG TGCGGATGAC CAAAGAGGCC GCGGGGCGGG CCAAGATCGA CCCCCTGATT
GCCGGCATGA ACGCCTTTAC CTTGATGAGC CGCAATCCGG TTGCGGCGGG GTCCAAGACC
TTCGTTTACA ACGGGATGTG A
 
Protein sequence
MLDAFIDPAE RAAWSTAVPD WEERILNRQS LIPDLPLWDE PAERALRIFK RLRVPDLIGQ 
PTYGEVSDQW VFDLVRAIFG SYDPVKKRRM LREFFLLIPK KNGKSAIAAA IILTACIMNE
RPEAELLLIA PTMTIAKISF KQIKGIIRAD PELDKRFHIQ DHARMITHLV SKAEISVKAA
DGDVITGGKA TYTMIDETHE FARKSKADGV FLELRGALAS RPEGFVMQIT TQSKEQPAGV
FKAELETARA VRDGRLQSPM LAVLYELPKK LAKSWQKQET WALVNPHLGR SVDPAFLQDQ
LVKAREKGPK ELQLLASQHF NVEIGVGLGG GWTGAHYWKK AGPQTFGLDE LIARSDVAVV
GLDGGGLDDL FGLAVVGREI ETKNWLMWFH AWAHPEVLRV RKEIASRLGD FAKAGDLILL
GEDEPTGDIE GAARIVGKLL EAQLLPEEAA IGLDTVQVYA ILEELMSIGV AEDQLRNIGQ
DWRLSPAIWG MERKLKDGTL LHSGQPMMEW VLGNAKVEQR GSAVRMTKEA AGRAKIDPLI
AGMNAFTLMS RNPVAAGSKT FVYNGM