Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3812 |
Symbol | |
ID | 5077960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 467518 |
End bp | 468504 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640481535 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001166197 |
Protein GI | 146276037 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.254073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGCAT CACTTGCCGA AGCCAATGTG CCGGGGGTCA ACAGCCTCGT CTCGCAGGAT GCCGAAGTCG TCCACAATGC ATTGAGCCGG CAGCTTGCGC GCCACAGCTT CGAATGCAGC CACACACGCA AGCTGGATGC CCACATCCAC AGCGCCGAGA TCGGGTCGAT CCAGATTGTC GACCTGCAAT ATGGTGCCGA CGTCGCAGTT TCGGCCGAGC TTGGCGATTC CCATGTCCTC GTCCACCTGG CGCTGGATGG CGAGACGACG ATGTGGGCCA ACCACGGAAA GGTCGTTCTC CGGCGGGACG AAATGCTGGT GTCATCGCCC GGCACTCCCC TGCGCGTGGA AATGACGCCC TCCTGCCGAC ATCTTGCAGT ACGGCTGCCG GTTGCCACCT GCACCGAATA CCTTGCACGA GAGCTCCACA TCCCGGTGAG CCGCCCGCTC GAGTTCTATT CCGGCAACCA GGGCGCGCGG GAACTGCCGC TGGTATGGCG AGGCATGGTC CAGCACCTTG GCGAACAGTT GCGCCTCGGC CCGACGATCA TGGCCAGTCG CCGGCTCAAG CGGCAGTACG AGATGGTGCT GGCCGAGATC TTGCTGGGCA ACTACTGCAA CAGCTATTCC GAACAGATCG CGCTGCACGG AAACGACATT TCGCCACGCC ACGTGCGCCG CGCCAGAGAG ATAATCCATC AGTCGCTAGA CGACAACGTG TCGATCAATG CGCTGGCTGC GCAAGTCGGG GTTTCGGTGC GCTCGCTGCA GAACGGGTTC CGCGACTTCC TCGGCGTCAC GCCACTGGAA TACGTTCGCC GCCACCGGCT TGAACGCCTG CACTCTGCCT TGATGAGCGC GGCCGGCGAT GCCAGCGTGA CCGAGCTGAT GCTCGAATGC GGCATCGTCA ATTTCGGGCG CTATGCCCAG TATTACCGCC AGCAGTACGG CTGCCTGCCT TCCGAGACGC TCCGCCGCCG GGTCTAG
|
Protein sequence | MLASLAEANV PGVNSLVSQD AEVVHNALSR QLARHSFECS HTRKLDAHIH SAEIGSIQIV DLQYGADVAV SAELGDSHVL VHLALDGETT MWANHGKVVL RRDEMLVSSP GTPLRVEMTP SCRHLAVRLP VATCTEYLAR ELHIPVSRPL EFYSGNQGAR ELPLVWRGMV QHLGEQLRLG PTIMASRRLK RQYEMVLAEI LLGNYCNSYS EQIALHGNDI SPRHVRRARE IIHQSLDDNV SINALAAQVG VSVRSLQNGF RDFLGVTPLE YVRRHRLERL HSALMSAAGD ASVTELMLEC GIVNFGRYAQ YYRQQYGCLP SETLRRRV
|
| |