Gene Saro_2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2600 
Symbol 
ID3917015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2808811 
End bp2809866 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content66% 
IMG OID640445359 
ProductAraC family transcriptional regulator 
Protein accessionYP_497870 
Protein GI87200613 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.867319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCGC CCACTATCTC CGCGCCGTTC CTGCGCCACG TGGCCAATTG CGTCGAGCTT 
ACGGGCCGTA GCGCCGCGCC GCTGCTGGAG GAACTGGGCA TCGCGCAGGA ACGGCTCGAC
GATCCCGAAG GCCTCATTCC CCTGGCCGCC TTTCTCGCCT TTTTCGAGAG CGCGGCCACC
CTCGTGCGCA ACCCCCACTT CGGGCTTCAT GCCGGGCGGC TGGCCGGTTC GGACAGCCTC
GGGCCGTTGA GCTTCCTGTT CCTGAGCGCG CCCGACCTGG GTGCGGCCTT CACCAGCTTC
ACACGCTATC TCGCGCTGAT GCAGCAGGCT TCGCGCAACA CCTTCACCAT CGGCGATCGT
TGGGCCACGT TCGAATACAT GGTGCAGGAT CAGCGCCTTA CCGCCCGGCG GCAGGACGCC
GAATACTCGA TCGGCGCAAT GTTCAGCCTT GCCCGGCAAT TCACCGGCGG CACCATCGAA
TTTCGCGAAG TGCGGTTCGA GCACGAGCGC GTCGGCGACT ATGCCCGCTA CGCCGACTTC
TTCGGCTGCG ACGTCTTCTT CGAACAGGAA ACCAACGCCC TGTCCTTCGA CCGGGGTTGC
CTCGAAATTC GCGGCAAGGT GCTCAGTCCG TCGCTGCATC CGATCATCGA GGACCACTTG
CGCCGCCGCG AATCTCCGGC GGCAGCGGCG ATGGCGAGCT TTGCGGATCG CGTGCGCACC
ATCGTTGCCG CCACGCCACT CGACCGGCAC CTGCCGGCAA GCGACGCGGC AAGGCGGCTG
GGATGCTCGT TGCAGACATT CCACCGGCGG TTGGCGCAGG AAGGCGCGAA CTGGCGGACG
CTTGTCGCGG AACACCGCAT GGAAGCAGCC GCGCGCCTGC TACGCGACAG CCGACGCGAG
ATCAGCGCGA TTGCCCTGGC GCTCGGCTAT TCGGAAAGCG CCGCTTTTGT CCGCAGTTTC
AGCCGCCATT TCGGCCAATC GCCGGGACGC TACCGCCGGC ACCTGCAAAA TGGCGCCGCC
CTCCCGGTGG GAGAGGACGG CGCCAGTTCA GGGTGA
 
Protein sequence
MTAPTISAPF LRHVANCVEL TGRSAAPLLE ELGIAQERLD DPEGLIPLAA FLAFFESAAT 
LVRNPHFGLH AGRLAGSDSL GPLSFLFLSA PDLGAAFTSF TRYLALMQQA SRNTFTIGDR
WATFEYMVQD QRLTARRQDA EYSIGAMFSL ARQFTGGTIE FREVRFEHER VGDYARYADF
FGCDVFFEQE TNALSFDRGC LEIRGKVLSP SLHPIIEDHL RRRESPAAAA MASFADRVRT
IVAATPLDRH LPASDAARRL GCSLQTFHRR LAQEGANWRT LVAEHRMEAA ARLLRDSRRE
ISAIALALGY SESAAFVRSF SRHFGQSPGR YRRHLQNGAA LPVGEDGASS G