Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0103 |
Symbol | |
ID | 4078688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 109602 |
End bp | 110837 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005390 |
Product | hypothetical protein |
Protein accession | YP_612098 |
Protein GI | 99079944 |
COG category | [S] Function unknown |
COG ID | [COG1322] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGCA TGGAGATTGC CATCGCCAAT ATGAGCGCCG CAGGACCCGT CCTTTGGGCC GCCGGCGCAG GCATGCTTTT GGTGCTTGTG TTGCTGTTTC AATCGTGGCG CACGTCGGCG CGCACCGCCC GCGCACTGGA GCCCCTGAGC CAGCAGATGC ACACGCTCGG GCATGTGGCG CAGCAGCTCT CTGCCGGGCA GGACGCCTTG CGCGGCAACT TGCAGACAGT GTCGGACACT CAGGCGCATG CGCAGATGCA GATCCTTCAG ACCATGGAAG CGCGCCTTGG AGATGTGCAA CAGCGGATGA ACGACCGGCT GGCAGAAAAC GCGATGAAAC AGGCGCGCGC AATGTCCGAG ATGCAGGAGC GCATGGCCGA GAGCCTGCAC GGGAATGCCA AACGCACTGC TACCTCGCTC ACCCAGTTGC AAGAACGGCT TGCGGTGATC GACAAGGCGC AGGACAATAT CACGAAGCTC TCGGGCGATG TGCTTTCGCT GCAGGATATC CTGTCAAACA AGCAGACGCG GGGCGCCTTT GGTGAGATCC AGTTGAATGA CATTGTCTCA AAGGCGCTGC CGAGCGATTC CTATGCATTC CAACACACGC TCTCCAATGG CAAACGCGCG GACTGCCTGA TCCACTTGCC CAACCCGCCC GGGCCCATCG TGATCGACAG CAAGTTCCCG CTCGAGCCCT ATGAAGCGCT GCGCGGCGCT GAAACCCAGG AGGCGCGCGC CCAAGCGGCC CGGCTCCTTA AGGGCGCGCT GCGCAAACAT ATCCGAGACA TCGCAGAGAA ATATATCCTC GAAGGGGAAA CCGCCGACGG GGCGCTGATG TTTCTGCCTT CGGAGGCGGT CTATGCAGAG CTACACGCGA ATTTCTCGGA TGTGGTGCGC GAAGGGTTCT CGCTCAAGGT CTGGATCGTC TCGCCCACCA CATGCATGGC GACGCTGAAC ACGATGCGGG CGATCCTGAA AGATGCCCGC ATGCGCGAAC AGGCGGGCGC CATTCGCCAG GAACTGGGTC TGCTGCACAA GGATGTTGAA CGTCTCGGCG ACCGGGTGGG CAATCTCGAT CGGCATTTCG CGCAAGCCCA ACGGGATATT TCCGATATCA AGATCAGCGC CGACAAGGCT GGGCGACGCG CCCAGCGGCT AGATAATTTT GACTTTGAGG ACCTTAACCC AGAGAGCGTA TCGCGGGTTG TTGCCCTGGA GCACCCGGGC GAATGA
|
Protein sequence | MDGMEIAIAN MSAAGPVLWA AGAGMLLVLV LLFQSWRTSA RTARALEPLS QQMHTLGHVA QQLSAGQDAL RGNLQTVSDT QAHAQMQILQ TMEARLGDVQ QRMNDRLAEN AMKQARAMSE MQERMAESLH GNAKRTATSL TQLQERLAVI DKAQDNITKL SGDVLSLQDI LSNKQTRGAF GEIQLNDIVS KALPSDSYAF QHTLSNGKRA DCLIHLPNPP GPIVIDSKFP LEPYEALRGA ETQEARAQAA RLLKGALRKH IRDIAEKYIL EGETADGALM FLPSEAVYAE LHANFSDVVR EGFSLKVWIV SPTTCMATLN TMRAILKDAR MREQAGAIRQ ELGLLHKDVE RLGDRVGNLD RHFAQAQRDI SDIKISADKA GRRAQRLDNF DFEDLNPESV SRVVALEHPG E
|
| |