Gene Saro_2508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2508 
Symbol 
ID3916829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2710971 
End bp2712032 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content67% 
IMG OID640445265 
ProductAraC family transcriptional regulator 
Protein accessionYP_497778 
Protein GI87200521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCG CCGACAACAT TCTCGCTCAT GCCAGCGCCA TGCGGCAGTT TTCCCAGTTC 
GCGGGAGTGG CCGGGCTCGA CCTGCGCCGC GCCTGCCCGC CAGACGTCTA CGCATTCGTC
GAGCAGGCGC AGGATGCGGA GTGGTTGCCG GCGACGGCCC ACGTCGATGT TCTCCAGGCC
GCCGCAATCG CTTCGGGCCG GGCAGACCTC GGTGTCGCAT TCGCCATGTG GTGCAACATC
CGTGGCTTCG GTCCTGTGAG CCTGCTGTGG GACCATTGCA CCACTGTCGA CGAGGCGAGC
CGCATTACCC GGCGCTACAT GCACCTGGAG AGCGCGGCCA TGCGATCGAG CACGGATACC
GACGGGCACG AGGCTGCGCT GCGCCACATC CTGATGGTTC CGGCCCGTTT CGGCGGATCG
CAGTTCCTGC AGGCTACGCT GGCGCTGCAA CTGCGCATCA TCCGGATGCT TCTGGGCGAG
GAGTGGACGC CGATCAGGCT CGAGCTGGAT CATCCCGCGC CGCCTTCGTA TCGCTATCAC
CAGGCCGTGT TCAGATGTCC GATCGAGTTC GAGGCGGACC GGTGCGCACT GGTTTTCCGC
AAGTCGGACC TGCACCGGCC TTCGCTGCGC GGGAATGCGA ACATGGTGCA ATATCTCGAA
CGGCAACTGG CCCATGCGGA TTCGCACTGG CCCGGCGATC TCGTCCAGCA GATCCGCTAT
TTCGTCGCCG CCAACCTGAC CGAGCGCAAG GCCAACCTCG CGCATGTCTC GGGGCTCGCC
GGGCTCTCGT CGCAGAGCCT GCAACGCCGC CTGGCCGAAC GGGGAACGAC GTTCGCGACG
ATCCTCGAGG AGGTGCGCAA GCAGACGGCG GACGAGTATT TCCGTACCGC GCGCCGCCCG
AACCTGACGG AGCTTTCGCA TCGACTGGGC TATACCGACG CGAGCGCGGC AAGCCGTTTC
CTGCGCCAGC ACATGTCGAC CGGCGCCCGC GCGTTGATGG CGCAGGTAAG GCCGGGGCGC
GGTCGTCCGG GCAGTGCCCG CGCGCTAGCG GCCGAGGCTT GA
 
Protein sequence
MTSADNILAH ASAMRQFSQF AGVAGLDLRR ACPPDVYAFV EQAQDAEWLP ATAHVDVLQA 
AAIASGRADL GVAFAMWCNI RGFGPVSLLW DHCTTVDEAS RITRRYMHLE SAAMRSSTDT
DGHEAALRHI LMVPARFGGS QFLQATLALQ LRIIRMLLGE EWTPIRLELD HPAPPSYRYH
QAVFRCPIEF EADRCALVFR KSDLHRPSLR GNANMVQYLE RQLAHADSHW PGDLVQQIRY
FVAANLTERK ANLAHVSGLA GLSSQSLQRR LAERGTTFAT ILEEVRKQTA DEYFRTARRP
NLTELSHRLG YTDASAASRF LRQHMSTGAR ALMAQVRPGR GRPGSARALA AEA