Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2600 |
Symbol | |
ID | 3917015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2808811 |
End bp | 2809866 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445359 |
Product | AraC family transcriptional regulator |
Protein accession | YP_497870 |
Protein GI | 87200613 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.867319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCGC CCACTATCTC CGCGCCGTTC CTGCGCCACG TGGCCAATTG CGTCGAGCTT ACGGGCCGTA GCGCCGCGCC GCTGCTGGAG GAACTGGGCA TCGCGCAGGA ACGGCTCGAC GATCCCGAAG GCCTCATTCC CCTGGCCGCC TTTCTCGCCT TTTTCGAGAG CGCGGCCACC CTCGTGCGCA ACCCCCACTT CGGGCTTCAT GCCGGGCGGC TGGCCGGTTC GGACAGCCTC GGGCCGTTGA GCTTCCTGTT CCTGAGCGCG CCCGACCTGG GTGCGGCCTT CACCAGCTTC ACACGCTATC TCGCGCTGAT GCAGCAGGCT TCGCGCAACA CCTTCACCAT CGGCGATCGT TGGGCCACGT TCGAATACAT GGTGCAGGAT CAGCGCCTTA CCGCCCGGCG GCAGGACGCC GAATACTCGA TCGGCGCAAT GTTCAGCCTT GCCCGGCAAT TCACCGGCGG CACCATCGAA TTTCGCGAAG TGCGGTTCGA GCACGAGCGC GTCGGCGACT ATGCCCGCTA CGCCGACTTC TTCGGCTGCG ACGTCTTCTT CGAACAGGAA ACCAACGCCC TGTCCTTCGA CCGGGGTTGC CTCGAAATTC GCGGCAAGGT GCTCAGTCCG TCGCTGCATC CGATCATCGA GGACCACTTG CGCCGCCGCG AATCTCCGGC GGCAGCGGCG ATGGCGAGCT TTGCGGATCG CGTGCGCACC ATCGTTGCCG CCACGCCACT CGACCGGCAC CTGCCGGCAA GCGACGCGGC AAGGCGGCTG GGATGCTCGT TGCAGACATT CCACCGGCGG TTGGCGCAGG AAGGCGCGAA CTGGCGGACG CTTGTCGCGG AACACCGCAT GGAAGCAGCC GCGCGCCTGC TACGCGACAG CCGACGCGAG ATCAGCGCGA TTGCCCTGGC GCTCGGCTAT TCGGAAAGCG CCGCTTTTGT CCGCAGTTTC AGCCGCCATT TCGGCCAATC GCCGGGACGC TACCGCCGGC ACCTGCAAAA TGGCGCCGCC CTCCCGGTGG GAGAGGACGG CGCCAGTTCA GGGTGA
|
Protein sequence | MTAPTISAPF LRHVANCVEL TGRSAAPLLE ELGIAQERLD DPEGLIPLAA FLAFFESAAT LVRNPHFGLH AGRLAGSDSL GPLSFLFLSA PDLGAAFTSF TRYLALMQQA SRNTFTIGDR WATFEYMVQD QRLTARRQDA EYSIGAMFSL ARQFTGGTIE FREVRFEHER VGDYARYADF FGCDVFFEQE TNALSFDRGC LEIRGKVLSP SLHPIIEDHL RRRESPAAAA MASFADRVRT IVAATPLDRH LPASDAARRL GCSLQTFHRR LAQEGANWRT LVAEHRMEAA ARLLRDSRRE ISAIALALGY SESAAFVRSF SRHFGQSPGR YRRHLQNGAA LPVGEDGASS G
|
| |