Gene Saro_3926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3926 
Symbol 
ID5077410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp97655 
End bp99322 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content65% 
IMG OID640481033 
Producthypothetical protein 
Protein accessionYP_001165695 
Protein GI146275534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGC TGACCAAATT GCGCGCCCGG GTTTCACCGC AAGCGATCAC CAAGGTGACC 
CGTATCTTCA ACGGGACCCT CGACGACATC TTCGCCGAAC TCTTCCAGAA CGCCCGGCGC
GCCGGCGCCA CACGCGTCGC GGTGACGGCC GAGCACCTGG ACGATGCCTG CCTGATCACC
GTCGATGACG ACGGCGCAGG TATTGCTGAT CCCATCGATC TCGTCTCGCT CGGCCAATCC
GATTGGCCCG GTGAATGCCG GTCACGTGAG GACCCTGCCG GCATGGGCTT CTTCAGCCTC
GCCGGGCTTG ACACGGTGGT CAGTTCCACC AGTGCTGCTG GCTCGTTCTG GCTGGCGATC
GCAGGTGATG CCTGGACCGG CGAGGCCGAC ATTGATGTGC TGCCGTGGGA TGGTCCGCGC
GGCACGGCAA TCGCGTTCCA CTTTCCGCCC GGACCGGATG GCAAGCTCGA GCGGACCGTT
GAGGCAGCGG CGCGGTTCTT CCCCCTGCCG GTCACTTACA ACGGCAAGGA TATCGCCCGC
GCCGACTTCC TCGCCGATGC CTACAAGATC ATCGAGCGCG ATGGCTTCCG GATCGGGGTG
TTCCGCGACC GGCATTCGCC GCACGTCGCG ACGCTCAATT TCCACGGGGT GACGCTCAAG
CACGCGTTCC CGGTGATCAA GGAAGTCCAC CACACCCAGT GGAGCGTCCA GGTCGATATC
GTCGATGCCC CGGACCTTGT TCTCGTCCTG CCCGCCCGCA AGGAGATCTA CCGCAATACC
GCGCTCGACC ATCTCGTCGC GCTGTGCCGC CGTGCGATCT TCTCGGTCAT CTACGCCGAG
CCGCTGCACA GGCTGAGTTT TGAGGACTGG CTCGAGGCAC GGTTCTACTC CGACGATTTC
CCTCAAGCGG CGCGGCAACT GCCGCTCTGG TCGCCGTCCA CCGCCCGCGA AGACTATCGC
CAGGTCCCGG CCTTCGCCGA TCTCGAGCCG GGCGCGACCA TCTACGATGA CACGGACTCC
TATGATTCCG TGACGTTCGG CCGCGCGCTC CGGCGCTCGA ATGGCGGGGA ACCACACAAG
CTCGGCGGAC CCGACCCCCG TGCGTTCCAT GAGCCGATCA CCAATTTCAT CGGCTATCCG
TGGTATGACG CCCTGTCCTG CTTCATCCGC ACTGGCGAGT GCCTCACGCA CGACGGCGAT
CAGGCGGCCG CGAGTGAGCC CGATGCCCTG ACCCAGAGGC CGGACGCTAT CCGGATCGAA
CTGACCGATC AGCACGGCAA CCGCCTCGAC GTCGAGACCG ACTTCGTCAT CCAGGAGGGC
GACGATTCCT GGGGCGATCC CGACTGCGCG GTGATTGCAG TTACCCGGGG CTCGGAACTC
GATCCGAACG ATCTCACCGA CCTCATCATC GATGCGGTGT TCTCGCCTTC GGACGATTCC
GATGCCGACA GCTACGACAC CCAGGAGACC CGCTTTCGCC ACGACGCTGC CGTGCGGGCC
CATGCCATCC TTGAAGGCGA TGACGCCGCC ATTCTCGCTG GCATCCGCAT GGCCTTCGCC
GACCGTGTTG CCTGGCGCAT CCCGCATGGC CGCAAGCTGC AGCTGACCTG GTCGAGCAGC
GGCAATGACC TCACCCTTGT CGTCGCGGGG GAGGGCGCCA ACCAATGA
 
Protein sequence
MTTLTKLRAR VSPQAITKVT RIFNGTLDDI FAELFQNARR AGATRVAVTA EHLDDACLIT 
VDDDGAGIAD PIDLVSLGQS DWPGECRSRE DPAGMGFFSL AGLDTVVSST SAAGSFWLAI
AGDAWTGEAD IDVLPWDGPR GTAIAFHFPP GPDGKLERTV EAAARFFPLP VTYNGKDIAR
ADFLADAYKI IERDGFRIGV FRDRHSPHVA TLNFHGVTLK HAFPVIKEVH HTQWSVQVDI
VDAPDLVLVL PARKEIYRNT ALDHLVALCR RAIFSVIYAE PLHRLSFEDW LEARFYSDDF
PQAARQLPLW SPSTAREDYR QVPAFADLEP GATIYDDTDS YDSVTFGRAL RRSNGGEPHK
LGGPDPRAFH EPITNFIGYP WYDALSCFIR TGECLTHDGD QAAASEPDAL TQRPDAIRIE
LTDQHGNRLD VETDFVIQEG DDSWGDPDCA VIAVTRGSEL DPNDLTDLII DAVFSPSDDS
DADSYDTQET RFRHDAAVRA HAILEGDDAA ILAGIRMAFA DRVAWRIPHG RKLQLTWSSS
GNDLTLVVAG EGANQ