Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3532 |
Symbol | |
ID | 4075211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 567761 |
End bp | 568780 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005047 |
Product | LacI family transcription regulator |
Protein accession | YP_611766 |
Protein GI | 99078508 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.895947 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACACC GTTTTCCAAT CAAGGAGATC GCGCGGCAGG CGGGTCTTGG CACCGCCACT GTCGATCGGG TCCTGAATGA CCGGGCGCAT GTGAGCCCGC AGACAAAGCT GCGTGTTACC GCTGCAATAA AAGAGTTGAA GGCCCAAGAG GCCCAGCTTG CTGCTCATGG AAGGCGATTG TTCTTTGACT TCGTCGTTGA AGCGCCATCA CGCTTCAGCC TTGAAGTGAA GGCGGCCGCA GAAGCAGTAC TCCCTCAGAT CGGAACCGCT GTTTGCCGCC CTCGATTTCT GCTGCAGGAG ATCATGGAAG AGGATGAGGT CGTCGGGGCA CTGAAACGGA TCATGAAGCG AGGTAGTCAG GGCGTGTGTC TAAAGGCGCG GGACACGGCG CGGATTAGGG AAGCAGCGAA GACGCTGACC GCCGCAAAAA TCCCCGTGGT CACGCTGGTC ACCGACATCG GGGGTACTGA TCGTCTTGCC TACGTCGGGT TAGACAACGC CGGTGCAGGA CGCACTGCAG CCTACCTTAT CTCCCGAGCG CTTGGGGATG TGCAGGGAAT GGTCTTGGCC ACGCGCAGCC ATGAACGCTT TCTAGGAGAA GAAGAGCGCG AGTTCGCATT TGTCGAAACC TTGGCACGCG AGCGTCCAGG TCTACAGGTA TTTGCTGTCC AGGGCGGTAG TGGAGTGGAC TTTGAAACGT CAAAGCTCTT AACGAAGTCC ATGGTTGGCA TTCACCATCT GCGCGCGGTC TATTCGATGG GGGGTGGCAA CCTATCGATC CTACGCACGC TGGAGCACAA AGGTCTGAGC CCCGATGTGT ACGTGGCCCA TGATCTTGAT CGGGAAAACA GGGAGCTGAT CCAGGACCGG CGCATCGACT TCATCCTGCA TCACGATTTG CAGCTGGACG TACGGAACAC GTTCAACGCC TTTCTATCCT ATCATGGGCT GTCCAGTGGT CTTGTGGGGG CGCCGATCTC CACGGTCCAG GTGCTGACAC CGGAGAATAT ACCGCGTTGA
|
Protein sequence | MTHRFPIKEI ARQAGLGTAT VDRVLNDRAH VSPQTKLRVT AAIKELKAQE AQLAAHGRRL FFDFVVEAPS RFSLEVKAAA EAVLPQIGTA VCRPRFLLQE IMEEDEVVGA LKRIMKRGSQ GVCLKARDTA RIREAAKTLT AAKIPVVTLV TDIGGTDRLA YVGLDNAGAG RTAAYLISRA LGDVQGMVLA TRSHERFLGE EEREFAFVET LARERPGLQV FAVQGGSGVD FETSKLLTKS MVGIHHLRAV YSMGGGNLSI LRTLEHKGLS PDVYVAHDLD RENRELIQDR RIDFILHHDL QLDVRNTFNA FLSYHGLSSG LVGAPISTVQ VLTPENIPR
|
| |