Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1625 |
Symbol | |
ID | 4077727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1732375 |
End bp | 1734075 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638006938 |
Product | phage terminase |
Protein accession | YP_613620 |
Protein GI | 99081466 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000276041 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.989877 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGATG CGTTCATTGA TCCTGCTGAG CGGGCGGCTT GGTCCACGGC GGTTCCGGAC TGGGAAGAGC GGATCCTCAA TCGGCAATCG CTGATCCCGG ATCTTCCGTT GTGGGACGAG CCAGCTGAGC GGGCGCTGCG GATCTTCAAA CGGCTTCGGG TTCCCGATCT TATCGGGCAA CCTACCTACG GCGAGGTCAG CGATCAGTGG GTCTTTGATC TGGTGCGTGC CATCTTCGGC AGCTACGACC CAGTCAAGAA ACGGCGGATG CTGCGGGAAT TTTTCCTGCT GATCCCAAAG AAGAACGGGA AGTCGGCGAT CGCGGCGGCG ATCATCCTGA CCGCCTGCAT CATGAATGAA CGGCCTGAAG CGGAGCTGCT GCTGATCGCC CCTACGATGA CGATCGCCAA GATCTCCTTC AAACAGATCA AGGGGATCAT CCGGGCCGAT CCGGAGCTGG ACAAGCGGTT CCATATTCAA GATCACGCCC GGATGATCAC GCATCTGGTC AGCAAGGCGG AGATCTCGGT AAAGGCCGCT GATGGGGACG TCATCACCGG TGGCAAGGCC ACCTATACGA TGATTGACGA AACCCACGAG TTTGCCCGCA AGAGCAAGGC GGACGGAGTT TTTCTCGAAC TGCGGGGGGC ATTGGCATCC CGGCCTGAAG GGTTTGTGAT GCAGATCACT ACCCAATCAA AGGAACAGCC CGCCGGCGTG TTCAAGGCTG AGCTCGAGAC CGCGCGTGCC GTGCGGGACG GGCGGTTGCA GTCGCCGATG TTGGCCGTGC TCTATGAGTT GCCCAAAAAG CTGGCCAAAA GCTGGCAGAA GCAAGAGACT TGGGCGCTGG TCAATCCGCA CCTCGGCCGA TCTGTCGATC CGGCCTTCCT GCAGGACCAG CTGGTCAAGG CGCGTGAAAA AGGGCCGAAA GAGCTGCAGC TGTTGGCCTC TCAGCACTTC AACGTCGAAA TCGGCGTCGG CCTCGGTGGC GGATGGACCG GCGCGCACTA TTGGAAGAAA GCAGGGCCGC AGACGTTCGG CCTTGATGAG TTGATTGCCC GGTCTGACGT AGCGGTTGTT GGGCTAGACG GCGGCGGTCT GGATGACCTG TTTGGGCTGG CCGTGGTCGG GCGCGAGATC GAGACCAAAA ATTGGTTGAT GTGGTTCCAC GCCTGGGCGC ATCCAGAAGT GCTGCGGGTG CGTAAGGAGA TTGCGTCGCG TCTGGGTGAT TTCGCCAAGG CTGGCGATCT TATTCTACTG GGTGAGGACG AGCCAACGGG AGATATCGAG GGCGCAGCGC GGATTGTTGG CAAACTTCTC GAGGCGCAGT TGCTGCCGGA GGAGGCCGCA ATCGGGCTGG ATACGGTGCA GGTCTACGCG ATCCTCGAAG AATTGATGTC GATCGGTGTC GCGGAAGATC AGCTACGCAA CATCGGTCAA GACTGGCGCT TGTCACCGGC GATCTGGGGC ATGGAGCGGA AGCTGAAAGA CGGCACGCTG TTGCACAGCG GGCAACCGAT GATGGAGTGG GTGCTTGGCA ACGCCAAGGT TGAACAGCGC GGTTCTGCCG TGCGGATGAC CAAAGAGGCC GCGGGGCGGG CCAAGATCGA CCCCCTGATT GCCGGCATGA ACGCCTTTAC CTTGATGAGC CGCAATCCGG TTGCGGCGGG GTCCAAGACC TTCGTTTACA ACGGGATGTG A
|
Protein sequence | MLDAFIDPAE RAAWSTAVPD WEERILNRQS LIPDLPLWDE PAERALRIFK RLRVPDLIGQ PTYGEVSDQW VFDLVRAIFG SYDPVKKRRM LREFFLLIPK KNGKSAIAAA IILTACIMNE RPEAELLLIA PTMTIAKISF KQIKGIIRAD PELDKRFHIQ DHARMITHLV SKAEISVKAA DGDVITGGKA TYTMIDETHE FARKSKADGV FLELRGALAS RPEGFVMQIT TQSKEQPAGV FKAELETARA VRDGRLQSPM LAVLYELPKK LAKSWQKQET WALVNPHLGR SVDPAFLQDQ LVKAREKGPK ELQLLASQHF NVEIGVGLGG GWTGAHYWKK AGPQTFGLDE LIARSDVAVV GLDGGGLDDL FGLAVVGREI ETKNWLMWFH AWAHPEVLRV RKEIASRLGD FAKAGDLILL GEDEPTGDIE GAARIVGKLL EAQLLPEEAA IGLDTVQVYA ILEELMSIGV AEDQLRNIGQ DWRLSPAIWG MERKLKDGTL LHSGQPMMEW VLGNAKVEQR GSAVRMTKEA AGRAKIDPLI AGMNAFTLMS RNPVAAGSKT FVYNGM
|
| |