Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0801 |
Symbol | |
ID | 4076073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 848343 |
End bp | 849593 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006099 |
Product | hypothetical protein |
Protein accession | YP_612796 |
Protein GI | 99080642 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.887797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000709779 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCACACG ATCTGACGGT TAGCGCGCCG CAGGGTGTTT TCCTAAGCGG CCTCAACACC AAGTTTCGCG CCTACGTTGG CGGGTTCGGG TCTGGCAAGA CCTATGTGGG GTGCCTCGAC CTCGGCTTGT TTGCAGGGCA GCACCCCAAG ACGGTTCAGG GATATTTCGC CCCGACGTAT CGGGACATTC GAGACACCTT CTGGCCAACC GTAGACGAGG CCGCGCATTC GCTGGGGTTC ACGACCAAGG TCAAGAGCGC CGACAAGGAG GTCGAGTTCT ACCGGGGCCG CAGCTACTAC GGCACCACCA TTTGCCGATC GATGGATGAT CCGGGCGGCA TTGTGGGCTT CAAGATCGCT CGCGCCCTGG TCGATGAGAT CGACATTCTC AGCAAGGACA AGGCGCAAGC CGCCTGGCGT AAGATCATCG CCCGGATGCG CCTGGTTCTC CCCGGCGTGG TCAACGGCAT CGGCGTCACC ACCACCCCCG AGGGGTTCCG GTTCGTCTAT GACAGCTTCA AGCGGGAGCC AAAGAGCAAC TATTCGATGG TGCAGGCCAG CACCTACGAG AACGAGGCGT TCCTGCCGCC AGACTACATT TCAACCCTGC TGGAGGACTA CCCCGAGGAG CTGATTAAGG CCTACCTCAT GGGGGAGTTC GTCAACCTCA CGAGCGGCAC CGTCTATCGC AGTTATGACC GGTTGCGGCA TCGATCAACA CAGAGCATCC AGCCGCGGGA GCCGCTGCAC ATTGGGCAGG ACTTCAATGT TGGCAACATG GCCTCGGTGG TTTTCGTCCA GCGCGGCGAA GATTGGCACG CGGTCGATGA GCTGCAGGGG CTGCAGGACA CGCCGCATCT GATCGAGGTT CTATGCGACC GATACGAGGG GCACCACCTC ACGATCTACC CCGACGCCAG CGGTAGCAGC CGCAAGACTG TCAATGCCAG CACGTCGGAT ATTGAGCTTC TGCGGAAAGC GGGTCACGCG ATCCGGGCGC CTAGCACCAA CCCGGCGGTG AAAGACCGGA TCCTCGCAGT GAATACGGCC TTCGAGAATG GCCGCCTCTT TGTGAACGCT CTCCGCTGCA AAGCCTACGC CGAAGCGCTT GAACAGCAGG CATATGACAA GAACGGCGAG CCGGACAAAT CCGCCGGTCT CGACCACCAC CCAGACGCGG GCGGCTATTT CGTCCACCAG AAAATGCCGG TCGTGAAACC GACCTTCACC CGGCAGGAGC TTCGCCTTTG A
|
Protein sequence | MSHDLTVSAP QGVFLSGLNT KFRAYVGGFG SGKTYVGCLD LGLFAGQHPK TVQGYFAPTY RDIRDTFWPT VDEAAHSLGF TTKVKSADKE VEFYRGRSYY GTTICRSMDD PGGIVGFKIA RALVDEIDIL SKDKAQAAWR KIIARMRLVL PGVVNGIGVT TTPEGFRFVY DSFKREPKSN YSMVQASTYE NEAFLPPDYI STLLEDYPEE LIKAYLMGEF VNLTSGTVYR SYDRLRHRST QSIQPREPLH IGQDFNVGNM ASVVFVQRGE DWHAVDELQG LQDTPHLIEV LCDRYEGHHL TIYPDASGSS RKTVNASTSD IELLRKAGHA IRAPSTNPAV KDRILAVNTA FENGRLFVNA LRCKAYAEAL EQQAYDKNGE PDKSAGLDHH PDAGGYFVHQ KMPVVKPTFT RQELRL
|
| |