Gene Saro_0612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0612 
Symbol 
ID3915624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp658339 
End bp659421 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content64% 
IMG OID640443342 
ProductABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_495893 
Protein GI87198636 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR03427] ABC transporter periplasmic binding protein, urea carboxylase region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0367247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCACGC TGATTGGCAA GCTCGGTATG ACGGGTCTGT TCGCGGGGGC GTTGCTGATC 
GCATCCTGCT CACCCTCTGG TGCGCCCCAT GCCGAACCCC GCAAGGAGTT CAGCATAGGC
TGGTCGATCT ATGCGGGATG GATGCCCTGG CCCTATGCCC AGCAGGCCGG CATCGTGAAG
AAGTGGGGCG ACAAGTACGG CATCAGGATC AACGTCGTAC AGGTCAACGA CTACGTCGAA
TCCGTAAACC AGTACACCGC AGGCAAGTTC GACGGCGTGA CCGTCACCAA CATGGACGCG
CTGACCATTC CCGCCGCGGG CGGCAAGGAC ACGAGCGCGA TCATCGTCGG CGACTATTCC
AACGGCAATG ACGGCATCCT GCTGAAGGGC GGCAATTCGC TTGCCGACAT CAAGGGGCGC
GAAACATATC TGGTCGAGCT TTCGGTGTCC CACTACCTGC TCGCCCGCGG GCTGGAAAAG
GCGGGTCTGA AGCCGACCGA CGTCAGGACC GTGAACACCT CCGACGCCGA TCTCGTCAGT
GCGTTCAGCG CGCCTGACGT GACCGCCGCG GTCACCTGGA ACCCGCAGCT CTCGGTGATG
AAGGCTCAGC CCGGCGTCAC CCAGGTCTTC AGTTCCGCCG ACATTCCGGG CGAGATCGTC
GACCTTCTGG TGGTCGATAC CGCCACGCTC AAGGCCAATC CCGATCTCGG CAAGGCGCTG
GCAGGCATCT GGTACGAAAC CGTCGCCCTG ATGCAGCGGC AGGACGAACA GGGCAAGGCT
GCGCGGGCCG CCATGGCCAA GCTCTCGGGC TCGACCCCGC AGGCGTTCGA CAGCCAGCTC
AAGACAACGT TCCTCTATGG CGAACCCAAG GCCGCCGTCG ACGCCGCCAC CGCGCCCGCG
CTCGTGACGA CGATGACCAA GGTCCGCGAT TTCAGCTTCT CGAAGGGCTT GTTCAAGGGC
GCCGCCTCGG CCGATGCGGT CGGCATGGCC TTCCCCGGCG GCAAGACCCT TGGCGATCCG
CAGCACGTCA CCCTGCGCTT CGACGAAAGC TTCATGAAGC TGGCCGCCGA CGGCAAGCTC
TGA
 
Protein sequence
MVTLIGKLGM TGLFAGALLI ASCSPSGAPH AEPRKEFSIG WSIYAGWMPW PYAQQAGIVK 
KWGDKYGIRI NVVQVNDYVE SVNQYTAGKF DGVTVTNMDA LTIPAAGGKD TSAIIVGDYS
NGNDGILLKG GNSLADIKGR ETYLVELSVS HYLLARGLEK AGLKPTDVRT VNTSDADLVS
AFSAPDVTAA VTWNPQLSVM KAQPGVTQVF SSADIPGEIV DLLVVDTATL KANPDLGKAL
AGIWYETVAL MQRQDEQGKA ARAAMAKLSG STPQAFDSQL KTTFLYGEPK AAVDAATAPA
LVTTMTKVRD FSFSKGLFKG AASADAVGMA FPGGKTLGDP QHVTLRFDES FMKLAADGKL