Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1609 |
Symbol | |
ID | 3918717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1676725 |
End bp | 1677795 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444349 |
Product | LacI family transcription regulator |
Protein accession | YP_496883 |
Protein GI | 87199626 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGCA GGCGCCAGGC GGTAACGATC AAGCACGTGG CAGCGGACGC CGGCGTCTCG CTGCAGACGG TCAGCCGCGT TATCAACAAC GAACCCAACG TGCGTCCGGA AATGCGCGAA AAGGTCCAGG CCTCGATCGA CCGGCTCGGC TACGTGCCGT CGATCGCCGC CCAGCGGATG AGCGGATCGC GTTCCTACCT CATACTCGCG CTGAACGACC GCGAGCGCAC GATTGCCGAC TGGCGGGCAC GTCAGGGCAC CGACTGGGTT GACCAGATGC TGCTCGGCGG GATGCTCGAA TGCGCAGAGC ACGGTTATCG CCTGATCTTC GAACTGGTCG ATACGCACAA CGACCATGTC GAACGCGAGC TGACCGCTGC CATCGCGGCG CTTCAGCCGG ACGGCGCGAT CCTCACGCCG CCGCACTCGG ACAATCCCAA GATCCTCGCG GTCCTTGCCC GGCACAAGGT GCCCTTTGCA AGGATCGGCG CGCAGACCAG CGAAAAGGGC CTGCCGCCGG GCATCCTCGT TTCGATGGAC GACGAAGGCG GCGCCCGCAC CGCGACGCGG CACCTGCTAG ATCTCGGACA TCGCCGCATC GGCTTCATAT CCGGACCTAC CGAATATCGC CTCGCGGGGA AGCGAGTCGA AGGCTGGCGC GCCGAAATGG AGGCGGCAGG GCTCGGTGTC GATGGCCTGC TCGAGGCTGG CGACTTCACC TACCAGTCTG GCGTGCGGGC TGCGCGCGCC CTGCTGACGA GGCCTGACCG GCCGAGCGCG ATCATCGCCA GCAACGACCA GATGGCGCTT GCCACGGTCG AGATTGCGGA CGAACTGGGC CTGTCTATCC CTGCCGACCT TTCGCTGGTC AGTTTCGACA ATACGCCGCT TGTGCGCTTC ACCCGCCCGG CGCTGACTGC CGTGGATCAG CCCATTGCCG ATACGACCGC GCGGGCGGTA AGGATGCTCA TCGCCTCACA CCGCAAGCCC GATGCCGACA TGGGCCCGGT GGTCATGCCG ATGGGTTTCG AGATCCGCGG CTCTACCGCG CCTTTCGGCA AGGGCGGCTA G
|
Protein sequence | MGRRRQAVTI KHVAADAGVS LQTVSRVINN EPNVRPEMRE KVQASIDRLG YVPSIAAQRM SGSRSYLILA LNDRERTIAD WRARQGTDWV DQMLLGGMLE CAEHGYRLIF ELVDTHNDHV ERELTAAIAA LQPDGAILTP PHSDNPKILA VLARHKVPFA RIGAQTSEKG LPPGILVSMD DEGGARTATR HLLDLGHRRI GFISGPTEYR LAGKRVEGWR AEMEAAGLGV DGLLEAGDFT YQSGVRAARA LLTRPDRPSA IIASNDQMAL ATVEIADELG LSIPADLSLV SFDNTPLVRF TRPALTAVDQ PIADTTARAV RMLIASHRKP DADMGPVVMP MGFEIRGSTA PFGKGG
|
| |