Gene Saro_3888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3888 
Symbol 
ID5077372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp56617 
End bp57801 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content59% 
IMG OID640480995 
Productmajor facilitator transporter 
Protein accessionYP_001165657 
Protein GI146275496 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0648184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCGA TGATCCAGGC ATTGCAGTCA CTTGGGTTTA CAACTTTGCT TCCGGCCTTG 
GGTGTCATGG CAAACGATCT GGGGGCAAGC GGTTCAAACC AGCGGCAATG GGTCATCGGC
ACCTTCCTGA TCTGCTCCGG CCTGTTTTCG CTCGTCCCGG GGACGATCTC CGACCGGCTT
GGGAGGAAGC CGGTCCTGCT GGTTTGCATG GGCCTGTTCG CCCTGATCAA TCTGCTTTGT
GCTTTCGTGG CTGACTTTTC CGTCCTGCTG GCGGCCCGCG CGCTGCTTGG CTGTGCCTCG
TCCGCCCTGA CCGTCCTTCC TCTGGCGATC ATTCGCGACC GCTATCAAGG CGAAGACATG
GCCAAGCTGC AAGCCTTTGT CGCGATGCTG TTTATGGCAG TCCCGACCCT TGCCCCAAGT
TTGGGTTATG TGATCTTTAC TACGCTCGGC TGGCGTGCCG TCTTCATTGT CATCGGGGTG
CTGTCGATGG GAGTGTCAGC ATGGTATTTC TACCGCATGG AAGAAACGCT GCCGCTTTCC
CGGCGGCAGT CCCATGGTGC CAGTGAATTG TTCGGCAACA TCCGCCTCGT TCTGACAAAC
CGCCGGTCGA TCGGTTACGT CATTGGCATG GCCCTGATTT TCGGCGCTCA CTTCGGCTTT
ATCAACAGCT CGCAGCAATT GATCGGCGAA CATTTCGGGG CCGGCGGGGC CTATTCGGTA
ATCTTTGGCT TGATGGCGGG CTCGATGATG CTGGCCAGCA TTGCCAATTC TGCGATCGTA
CACCGTTTCG GCATCCGGGG GATCGGCCAT GCCGCCATCC TGTGCCATTT CCTTGTGTCG
ATCGGCCAGA TCTATTTGGC ATCCCGACCG GGCGAAACCC TGCTTCAGTT TGTGCTTCTG
ACATCCGCGA ACATGTGCTT GCTGATCACG GTCTTCATCA ATTTCACCGC GATTTCCCTG
CAGCCATTCG GCAAGGTGGC CGGGGCCGCG GCCTCTGTTC AGACGTTCTT CCGTTTGGTG
CTTGGCGCGG GGCTGGGCGC ACTGATCGGG CAGGCCTACA ATGGTTCCCC GCTGCCTCTG
GCCTATTCGT TCTTCGGGGT GGCGAGCGTG ACCATCGTGC TTGTACTGTT CAGCGAAGGC
TGGCGATTGT TCGGGGGACC TATACCGGTT CAGGCTGCGA ACTGA
 
Protein sequence
MMAMIQALQS LGFTTLLPAL GVMANDLGAS GSNQRQWVIG TFLICSGLFS LVPGTISDRL 
GRKPVLLVCM GLFALINLLC AFVADFSVLL AARALLGCAS SALTVLPLAI IRDRYQGEDM
AKLQAFVAML FMAVPTLAPS LGYVIFTTLG WRAVFIVIGV LSMGVSAWYF YRMEETLPLS
RRQSHGASEL FGNIRLVLTN RRSIGYVIGM ALIFGAHFGF INSSQQLIGE HFGAGGAYSV
IFGLMAGSMM LASIANSAIV HRFGIRGIGH AAILCHFLVS IGQIYLASRP GETLLQFVLL
TSANMCLLIT VFINFTAISL QPFGKVAGAA ASVQTFFRLV LGAGLGALIG QAYNGSPLPL
AYSFFGVASV TIVLVLFSEG WRLFGGPIPV QAAN