Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1887 |
Symbol | |
ID | 3917108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1993798 |
End bp | 1994832 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444631 |
Product | LacI family transcription regulator |
Protein accession | YP_497161 |
Protein GI | 87199904 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.587669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCGTA AGCCATCCAA CAAGCCGACG AGTTTCGACA TTGCCTACCT CGCCGGCGTG TCCCAACCTA CCGTCAGCCG TGCGCTCAGG GGCAGCAAGT CGGTCAGCCT CGCCACGCGC CAGAAGATCG AGGCGATCGC GCGCCAGCTC AACTATACGG TCGACAAGAA CGCTTCGTCA TTGCGCTCGC AGCGATCGAA CACCCTCGCC CTGCTGTTCT TCGAGGACCC GACGCCGGAC GAATCGAACA TCAACCCGTT CTTCCTCGCC ATGCTCGGCT CGATCACCCG GCACTGCGCC AATCGCGGCC TCGACCTGCT GATCTCGTTC CAGAAGCTCG ATGACGACTG GCACAAGCGC TACCAGGACA GTCATCGCGC CGACGGGCTG ATCCTGCTCG GCTACGGTGA CTACACTCTC TACGGATCCC GCCTGCGCCA GCTCATCCGC TCGGGCACGC ATTTCGTGCG CTGGGGCTCG GTAGACGAAG GCACCATCGG GGCGACAATC GGGTCCGACA ACTTCGGCGC CGGACGCCTG GCGGGCGAGC ACCTCCTTGC CCGGGGCCGC AAGCGCATTG CCTTCCTCGG CCAGGCGGAT TCGCACTATC CAGAGTTCGA GCAGCGCTAC GCAGGCCTGT CCAAGGCCAT CCGCACAGCC GGGCTGGAGC CCGATCCGGA CCTTGTCGTC GATGCGACCT CGTCCGAGGA AATCGGCTAC AACGCCGCGC GGGAGCTGCT GTCGCGCGGC AAAACCTTCG ATGCCATCTT CGCCGCGAGC GACCTGATCG CCATCGGCGC GATGCGCGCG CTTGCCGAAG CCGGTCGTTC CGTGCCCGCC GATGTCGCGG TCGTCGGTTT CGACGACATC CCGGCCGCCA GCCTGACCAC GCCGCCATTG ACCACCATCA TGCAGGATAC GCGGCTTGCC GGTGAGGCTC TGGTCGATTG CGTGCTCGGG CAGGTCGAAG GCCGCCCACC CAGCCCGCGC ATCCTCCCCG CACGACTTGT CGTCAGGGCC AGCAGCGGCG GCTGA
|
Protein sequence | MGRKPSNKPT SFDIAYLAGV SQPTVSRALR GSKSVSLATR QKIEAIARQL NYTVDKNASS LRSQRSNTLA LLFFEDPTPD ESNINPFFLA MLGSITRHCA NRGLDLLISF QKLDDDWHKR YQDSHRADGL ILLGYGDYTL YGSRLRQLIR SGTHFVRWGS VDEGTIGATI GSDNFGAGRL AGEHLLARGR KRIAFLGQAD SHYPEFEQRY AGLSKAIRTA GLEPDPDLVV DATSSEEIGY NAARELLSRG KTFDAIFAAS DLIAIGAMRA LAEAGRSVPA DVAVVGFDDI PAASLTTPPL TTIMQDTRLA GEALVDCVLG QVEGRPPSPR ILPARLVVRA SSGG
|
| |