Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3344 |
Symbol | |
ID | 4075243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 355156 |
End bp | 356160 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004852 |
Product | LacI family transcription regulator |
Protein accession | YP_611578 |
Protein GI | 99078320 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000247508 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.28725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTTA CGTTGAAAGA GGTTGCAGAA CGTGCGGGGG TTTCCCGTTC TGCTGTATCG CGCACCTTTA CCGACGGGGC CTCTGTGTCC GACAAGATGC GGCGCAAGGT TGAAAAGGCC GCCCGCGAAC TGGGCTACAG CCCAAATGCG CTGGCCTCAT CGCTGACAAC AGGGCGCACC AAGCTGATCG GGCTTGTCTC GAACAACTTT CACAACCCCA TCTTTCTCGA GGTCTTTGAT CTCTTCACGC GCGGCCTTCA GGATCGGGGC TTGCGTCCAT TGCTTGTGAA CCTGACCGAT GAAACAGACC CCGAGCATTC TGTGAATATG CTGCGCCAGT ATTCCGTCGA TGGGGTGGTT GTGGCCTCGT CCACGTTGCC CCCGGGGTTT GCCAAGGCCT TTCGCGACGC AGGCGTGCCC GTGGTTCATA GTTTTGGCAG ATCATCCTCG GCGCCTCAGG TGCATGTTGT TGGGATCGAC AACGTTGAAT CGGGGCGCAT GGCGGCACGC GCGCTCATTG CGCGGAACTA TACGCATGTG GCCTTTATGG GCGGTCCGGA AACTGCGACC TCCACGCAGG ATCGCCATGC AGGCTTCATG TCGGAAATGT CCAAACACCC CAACATCCGG GCGACATACT CCTTCGCCGA GGCCTATTCC TTTCAGGCGG GACGCTCCGA GATGATGAGG CTTCTACAGG CTGGGCCTGC CGAGGCGTAT TTCTGTGGCG ACGACGTCCT CTCGATTGGT GCGCTCTCGG CCATCTCAGA CAGCGGCCTC AGCGTACCGA AGGACATCGG CATCATCGGA TTGAATGATA TGGAAATGGC TGGTTGGGAG AGCATCGACC TGACAACGGT TCATCAGCCG ATCCGACAAA TCGTCTCTTC ATCCATTGAA TTGATGGTGG CGATGCTCGA TGAGCCGGAC CGCTATCCCG AGGCCCGCAT CTTTCCCTGT TCGATTGTCG AGCGGGGCAC ATTGCGCCCC GCCCCCAAAA CCTAG
|
Protein sequence | MAVTLKEVAE RAGVSRSAVS RTFTDGASVS DKMRRKVEKA ARELGYSPNA LASSLTTGRT KLIGLVSNNF HNPIFLEVFD LFTRGLQDRG LRPLLVNLTD ETDPEHSVNM LRQYSVDGVV VASSTLPPGF AKAFRDAGVP VVHSFGRSSS APQVHVVGID NVESGRMAAR ALIARNYTHV AFMGGPETAT STQDRHAGFM SEMSKHPNIR ATYSFAEAYS FQAGRSEMMR LLQAGPAEAY FCGDDVLSIG ALSAISDSGL SVPKDIGIIG LNDMEMAGWE SIDLTTVHQP IRQIVSSSIE LMVAMLDEPD RYPEARIFPC SIVERGTLRP APKT
|
| |