Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3831 |
Symbol | |
ID | 5077979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 484991 |
End bp | 486013 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481554 |
Product | LacI family transcription regulator |
Protein accession | YP_001166216 |
Protein GI | 146276056 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.167541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCACG CCACAGACGC CGACGAGGCC GCCGATACCC GGGCACCGAC GCTCGATGAC GTGGCCAGCC GCGCGGGCGT TTCGACCGCA ACCGTCAGTC GCTTCGTCAA CAACCCGTCG GTCGTCGCCG CCGCGACGGC CGAGCGCATC CGCGATGCGA TTGCCGCGAC CGGCTATATC CCGAACCTGC TGGCGGGCGG GCTTGCGTCG AGCAAGTCCA AGATGGTCGC GGTGCTGATC CCCTATCTCA CCGACTCCAT CTTCAACGAC ACGATCCAGG CGATGGTGGA GGAACTCTCC GCCGCAGGGA CGACGGTGAT GCTGGGGCTG ACCGGCGTTT CGGAATCGCG GACCGAAGAC CTGATCCGCG CAGCATTGAG CCGCCGGGTC GATGCCATCA TCTTCACCGG ACCGGTGACG CCTCAGGTGG AACAGATGGT GCGGCGTTCG CCGGCGCTGT TCATCCAGGT GTGGGACCTG CCCGAAGATC CCATCGGCAT CGCGGTCGGC TTCAGCCACG AGGCGGCAGG CAGCGCGGTG GCGCGGTTCG TCATCTCGCG CGGCTACCGC AGGCCCCATC TCGTCACGGC GAAAGGATCG CGCGCGCAGA TGCGGCGCAC CGGATTCGTC GGGGCGTGGG AAGCGGAGGG CTCGGGGCCG TTCACCGAAT CCTCGGTCGA AGTGCCTTCG CGCTTTGGCC ACGCGCGGCG CATCTTTGCA GAGATCCGCC GCCTTCCGGA AATGCCGGAC GTGGTGGTGT GCGGGTCTGA CCATCTGGCG CAGGGCGTGA TTGTCGAGGC GCTTGCCGCG GGCCTCAAGG TGCCCGAGGA CATCGCCGTG GTCGGCTTTG GCAACAGTTC GATAGCGGGC GAAATGCGAC CGACGATCAC GTCCGTCGAA GTTGACGGCG CGCGGATCGC CCGCGAGGCC ATCGCTGCAA TCCGGCGCAA GGCCGACAAG TCCACCCCGC CCGAGCGGTG GATTGACGTC GGCTTCCGCT TGATCGCGCG CGAAAGCGCA TAA
|
Protein sequence | MNHATDADEA ADTRAPTLDD VASRAGVSTA TVSRFVNNPS VVAAATAERI RDAIAATGYI PNLLAGGLAS SKSKMVAVLI PYLTDSIFND TIQAMVEELS AAGTTVMLGL TGVSESRTED LIRAALSRRV DAIIFTGPVT PQVEQMVRRS PALFIQVWDL PEDPIGIAVG FSHEAAGSAV ARFVISRGYR RPHLVTAKGS RAQMRRTGFV GAWEAEGSGP FTESSVEVPS RFGHARRIFA EIRRLPEMPD VVVCGSDHLA QGVIVEALAA GLKVPEDIAV VGFGNSSIAG EMRPTITSVE VDGARIAREA IAAIRRKADK STPPERWIDV GFRLIARESA
|
| |