Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0020 |
Symbol | |
ID | 4078683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 21907 |
End bp | 23346 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005307 |
Product | hypothetical protein |
Protein accession | YP_612015 |
Protein GI | 99079861 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0116808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.937022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCG ATTTCACATC CTTCGTGGTG CTGGCAGAGA TGCGCACCGG CTCAAACTTT CTCGAAGCCA ATCTGAATGC TCTTAGAGGC GTGAGCTGTC TGGGAGAGGC GTTTAATCCT CATTTCATTG GCTATCCAAA CCAAGAGTCC ATCTTCGAGG TTTCCAGAGA AGCACGCGAT GACGATCCGC ATGCGCTCGT GGACGCGATC CACGCGGCGG AGGGGCTGTG CGGCTTTCGC TATTTCCATG ACCATGATCC CCGGATCCTT GAAAGGATCC TGAACGACCC GCAGGTTGCC AAGATCATTC TGACCCGGAA CCCGCTCGAC AGCTATGTAT CGTGGAAGAT CGCTCGGACG ACCGGGCAGT GGAAGCTGAC CAATGTAAAG CGACGCAAGG AGGCCAAGGC GGTCTTTGAT GCTGAGGAAT TTGGCGCCCA TGTGGATGGC CTCCAAGAAT TCCAGATCGC CGTCTTGAAC CAATTGCAAC GCACGGGGCA GACGGCTTTT TATGTCGCTT ATGAGGATCT TCAGAGCCTT GAGGTCATGA ATGGTTTGGC GACTTGGCTA GGGGTGGATG CGCGGCTTGA TGCGCTGGAC GACAGTCTAA AACGTCAAAA CCCCGCGCCC GTCATTGCCA AGGTCGAAAA CCCCGAGGAG ATGACCGAGG CCCTTGCGGG GATGGACCGG TTCAACCTCA CACGCACGCC GAATTTTGAG CCACGTCGCG GTCCCGCCGT TCCCGGATAC GTTGCGGGAG TGGTGACGCC GATCCTCTAT ATGCCAGTGC GGAATGGGCT TGAGGCTCAT GTGAGCGAGT GGATCGGGGG GCTCGACAAA GTCCCGGCCG AAGGGTTGAT CACCGGCATG AACCAAAAGC AGCTTCGCCA GTGGCGTCAT GCGAATCCGG GGCACCGCAG TTTCACGGTT GTGCGTCACC CATTGGCGCG TGCGCACCAT GTGTTCTGTA CCAAGATCCT GCCGAATGGG CCCGGAAGCA TGAAGCATAT CCGCAATACG TTGCGGCGCC AATTTCATCT GGATATTCCG CAAGATGGCA TTGATGTGGG CTATTCTCGC GAGAGACACC GTCAGGCGTT TGAGCAGTTT TTGACATTTC TCAAGTCCAA CCTTGCGGGT CAGACCGCGA TCCGGGTGGA TGCACGCTGG GCAAGTCAGG CGCAGAGCAT CAGCGGGTTT GCTGAGTTGG GTGCGCCGGA TCTGATCCTG CGTGAAGAGG ATCTTGCAAC AGATCTACCG TGGCTTGCGC GCAAGCTGGG TCGGATGTCT CCGGCGGAGG TTCCCCCTGT GCCCGCGGAT CAGCCGATCG CACTGGCGGA GATTTACGAT GACGCGCTCG AGGTCCTGTG CCGGTCGATT TACACCCGCG ACTATTTGAC GTTCGGTTTT GACAGTTGGT CGCCCAGGAT CGAGCGCTAG
|
Protein sequence | MSADFTSFVV LAEMRTGSNF LEANLNALRG VSCLGEAFNP HFIGYPNQES IFEVSREARD DDPHALVDAI HAAEGLCGFR YFHDHDPRIL ERILNDPQVA KIILTRNPLD SYVSWKIART TGQWKLTNVK RRKEAKAVFD AEEFGAHVDG LQEFQIAVLN QLQRTGQTAF YVAYEDLQSL EVMNGLATWL GVDARLDALD DSLKRQNPAP VIAKVENPEE MTEALAGMDR FNLTRTPNFE PRRGPAVPGY VAGVVTPILY MPVRNGLEAH VSEWIGGLDK VPAEGLITGM NQKQLRQWRH ANPGHRSFTV VRHPLARAHH VFCTKILPNG PGSMKHIRNT LRRQFHLDIP QDGIDVGYSR ERHRQAFEQF LTFLKSNLAG QTAIRVDARW ASQAQSISGF AELGAPDLIL REEDLATDLP WLARKLGRMS PAEVPPVPAD QPIALAEIYD DALEVLCRSI YTRDYLTFGF DSWSPRIER
|
| |