Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3311 |
Symbol | |
ID | 4075716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 320075 |
End bp | 321163 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638004819 |
Product | LacI family transcription regulator |
Protein accession | YP_611545 |
Protein GI | 99078287 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.472161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.750741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA GGACCCGCAT AACCATCAAG GATGTCGCCC TCGCTGCCGG ATGTGGCGTC GCAACTGCGA GTCGCGTGCT GAATAAATCC GGCCCCGCCA GCGCTGAGAC CCGCGCTCGT GTTGAAGATG CAGCGCGGCG CATGGGTTTT GTCTTTTCCG CGACTGGACG CGCCTTACAA AGCCGCAAGA GCATGACCGT CGGTTGCCTT ATTCCGTCGC TCGCCAACCC GGTGTTTGCG GAGGCCGTGC AGGGGGCTCA GGAAGAGTTG CGCAGCCACG GGTATCAGCT GTTGGTCGCC AGCTCCAATT ACGATGACGA AACGGACAAT GACATTCTCA CAACTCTCCT GAGTAAGGAT GTCGACGGCC TGTTGGTGAC AATGGCCGCG CCACAGGAGA GTGTGCCGCT CGCACAGGCT CGGGCGCGCG ACATCCCCGT CTGCCTGATG TTTCACGACC CTCTGCCCGA CTGGCCCAGC GCGCATGTGT CCAACGCGCA GGCCGCCGCA GAAGTCGCGC GCCAGTTTGC CCTCTACGGC CACGAGCGCA CCGGATTTCT GGCGTTGCGA TTTTCGACCT CGGACCGCTC ACGCAACCGG TTTGACGGGT TTCGGGCGCA ATGTGCCGCC CTCAATCTGG CCCCACCCAA GCTGATCGAG ATCACCGAGA CCGAAGCCAA CACGCCCAGT ATCCTTGCCC AGCGCCTCTC GGACCACCCG GATCTGACCG CGATCTTTGC CTCCAATGAC TTCCTGGCAA TCGCCGTGCA AAAAGCCGCG CCTTTGATGG GGCGGCATGT CCCGCAGGAT TTGTCGGTTG TGGGGTTCGA TGGGATCGAG GTCGGGCGTT TGCTGGATCG CTCGCTTGCC ACGATCGAGA CCACCCCAGA GGCAATGGGT CGCCAGGCCG CACAGACGCT TTTGACCGGA CTGCAGGGCG GCGCAATGAC AGAACTTGCG CCCCTGCCTT TTACTTTCCG CGCCGGGGCA ACCTTGTCCG GACCCCGCGC GAAAAGCCCT GACGACGACC GGGGTGCTGC CCAGTCGCCG TCTGTTCCCC CGTTCACGTC CAACGACAAA CAAGGATGA
|
Protein sequence | MTDRTRITIK DVALAAGCGV ATASRVLNKS GPASAETRAR VEDAARRMGF VFSATGRALQ SRKSMTVGCL IPSLANPVFA EAVQGAQEEL RSHGYQLLVA SSNYDDETDN DILTTLLSKD VDGLLVTMAA PQESVPLAQA RARDIPVCLM FHDPLPDWPS AHVSNAQAAA EVARQFALYG HERTGFLALR FSTSDRSRNR FDGFRAQCAA LNLAPPKLIE ITETEANTPS ILAQRLSDHP DLTAIFASND FLAIAVQKAA PLMGRHVPQD LSVVGFDGIE VGRLLDRSLA TIETTPEAMG RQAAQTLLTG LQGGAMTELA PLPFTFRAGA TLSGPRAKSP DDDRGAAQSP SVPPFTSNDK QG
|
| |