Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2279 |
Symbol | |
ID | 4078463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2395984 |
End bp | 2397486 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638007601 |
Product | hypothetical protein |
Protein accession | YP_614273 |
Protein GI | 99082119 |
COG category | [S] Function unknown |
COG ID | [COG4642] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.416244 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGTA TCAACTCTAC TCTGCTCCTG ACCTCCCTGC TCTCTTTGAA TGCCGCAGGG CTCAGCGCCC CCGCCTTTGC GCAGGACGAA CAGGTTCTGA CCACACAAGA CGAGATCGGT GGCGTCTATG AGGGCGAATT CAAGGGCGGC CTGCAACATG GTCAAGGGAC CTATAAGCTG CCAAACGCCT ATGAATATTC CGGCCAGTGG GTCGAAGGCG AGATCAAGGG TAAGGGGGTT GCCCGTTTCC CAAATGGATC AGTCTACGAG GGTGAGTTTT CCAAGGGGAA ACCCGAAGGT CTGGGCAAGA TCACCTTTGC CGATGGCGGC ACCTATGAAG GCGAGTGGCA AGACGGTGTG ATCAATGGCC AAGGCATTGC GATCTATGCC AATGGGGTGC GCTACGAGGG GTCTTTTGTG GACGCCAAAC ATGACGGGCG CGGGGTGATG CAAAACCCCG GCGGCTACCA ATACGAGGGC GATTGGGTTG CCGGGCGCAA GGAAGGCACT GGCAAGATCA CCTACCCCGA TGGCACCACC TATCAGGGCG GCGTCAAGGA CGGCAAGCTG CATGGTCTGG GGACGCTGGT GATGCCTGAT GGCCTTAAAT ACGAGGGCGA ATGGGCCGAC GATCAGATGA ATGGCACCGG CGTCCTGACG CAGCCCAATG GCGACGTCTA CGAGGGCCCG CTGGTCAACG GTCGTCGTCA GGGCGAGGGC GTGCTGCGCT ATGCCAATGG CGATGTCTAC GAGGGCCAGT TCGACGATGA TCTGCGTCAG GGCGAGGGCA CCTTTACTGG CACCGACGGC TATATCTACA GCGGTCAGTG GCAGGCCGGT CAGATCGAGG GTCAGGGCAA GGTCACCTAC CCGGATGGGT CCGTCTATGA GGGCGAATTC CGCGATGATC TGGCGCATGG GGTTGGCAAG ATCACTTACC CCGATGGCTC CACCTACGAG GGCGAATGGG TCGCTGGCGT GATCGAAGGC AACGGCAAGG CGACCTACGC CAATGGCGCC ATCTATGAGG GCAGCTTCAA GAACGCCAAA AACGACGGTC AGGGCGTAAT GACATCGCCC GAAGGCTATC GTTACGAGGG CGGCTGGAAG GACAGCCTGC GCCATGGCGA GGCCAAGGTG ACCTATGCCG ATGGATCGGT CTATGAGGGC GCGTTTGCAA ATGGCCAGCG CCATGGCTTT GGCAAGATCA CCCGCCCAGA CGGGTTCAGC TACGAAGGCC AATGGGTCGA AGGCAAGATC GAAGGCGAAG GCATTGCGAC CTATGCCAAC GGCGACATCT ACGAGGGCAG CTTTGTGGGG TCCAAACGTC AGGGCCCCGG CACCATGCGC TATGCCTCCG GCCAGGAGGC CTCGGGCACT TGGAACAATG GCGCGCTTAC CACACCAGAT GCCGCGGCCT CTGAGGCGGA TCAGAGCACG GATCCGGCCG CCGAGGAGAC GCCTGACGCA GAGGCAGGCT CGGCTGGGGA CGAAAGCAAC TAA
|
Protein sequence | MIRINSTLLL TSLLSLNAAG LSAPAFAQDE QVLTTQDEIG GVYEGEFKGG LQHGQGTYKL PNAYEYSGQW VEGEIKGKGV ARFPNGSVYE GEFSKGKPEG LGKITFADGG TYEGEWQDGV INGQGIAIYA NGVRYEGSFV DAKHDGRGVM QNPGGYQYEG DWVAGRKEGT GKITYPDGTT YQGGVKDGKL HGLGTLVMPD GLKYEGEWAD DQMNGTGVLT QPNGDVYEGP LVNGRRQGEG VLRYANGDVY EGQFDDDLRQ GEGTFTGTDG YIYSGQWQAG QIEGQGKVTY PDGSVYEGEF RDDLAHGVGK ITYPDGSTYE GEWVAGVIEG NGKATYANGA IYEGSFKNAK NDGQGVMTSP EGYRYEGGWK DSLRHGEAKV TYADGSVYEG AFANGQRHGF GKITRPDGFS YEGQWVEGKI EGEGIATYAN GDIYEGSFVG SKRQGPGTMR YASGQEASGT WNNGALTTPD AAASEADQST DPAAEETPDA EAGSAGDESN
|
| |