Gene Saro_3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3401 
Symbol 
ID5077982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp52 
End bp1395 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content67% 
IMG OID640481125 
Productmajor facilitator transporter 
Protein accessionYP_001165787 
Protein GI146275627 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.189557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCAGG GTTCCACCGG CGGCATCGGC CCAGGCAGCT ACCGCGCGCC CGAGAACGCG 
GCCTCCGAGC GTATCGAATC GAGCGTCGGC GGAAAGGTTG TCCGCCACCT CGTCATCCCC
GCCGCGCTCT ATATCCTGAT CGGCGCGATC GATCGCACCA ACGTCGGATT TGCCGCGCTC
GAAATGAACA AGGCACTCGG ACTTTCGGGA ACGCAGTATG GCTTCGGCGC GGGGGTCCTC
TTCGTCGGCT ACATGATCGC CAAGTACCCA AGTGTCCTGC TCTACGAAGC CATCGGCCTG
CGCCGTTGGC TGACCTTGAT CACCGCCGCG TGGGGCCTGT GCTCGTGCCT GATGGCGGCC
GTCGCCAACG AATGGCAGCT ATACGCCCTG CGGGTTCTGA TCGGGTTTTC GGAAGGCGGG
CTGTCGTCGG GCCTGATGCT CTATCTCAGC CTCTGGGCGC CAGAACGCTT CCGCGCGACC
ATTCTCGCGA TCCCCATTGC GTCCATCTCC ATCGCCCAGG TCGTCGGCGC GCCAATTTCA
GGCTTGCTCC TTGATCTCGA CCGGCCGTTG GGGCTCGAAA GCTGGCGCTT CATGTTCCTG
GTCGAGGCCC TGCCTGCCCT CGCCCTCGCA GCATTCGCCT GGCTCCACTT TCCCGATACG
CCCGCCGATG CCCGCTGGCT TACGGCGACC GAGCGCGACT GGATCGCCGC GAACGTGAAG
GGCGCCCGCA AGCCGGCGCC GGGCCAGGGT GCGGAGCGCT GGGCGGTCCT GCGCAGCCCG
GAAGGCTGGC TCTGTGCCGC CATCTGGTTC TGCATCCTCG CTTCGAACTA CGGCATCATG
TTCTGGTTGC CGCAGGTCGT GAAAAGCCTT TCAGGGCTCA GCTCCGCCAT GACCGGGGTC
ATAGTCGCGC TGCCGTGGGC GGCCAGCGGG ATCGGCCTGG TGCTGAACGC GCGCCATTCG
GACCGGACGG GCGAGCGCTA CCTCCATGTC GCCATCCCGG CCGTCGTCGG CGGCGCGGGC
CTGCTGCTTG CCTACATCTT CGGTGCCGGA TTGCCGGGCC TTGTCGCGCT GGTCATCGGC
GGTGCCTGCA CGGGCTGCAC CGTCGCGGCT TTCTGGGCTA TTCCCACCCG TCTGCTCTCG
CCGGGGGCGC TGGCCATGGG CATCGTCGCG ATCAACATGA CCGGCAGCCT TGCCGGCGCT
ACCGTTCCGC CGCTCATGGG CTATCTTCGC GAAACCACTG GCTCGTTCCT CCCGCCCGCG
CTGCTGTTGT GCGGCATCGC CGTCGTATGT GCCTGCCTGA CGCTCGTCGC GCGCCGCGTC
GCCGCGCGGG CCGAAGCGCA GTGA
 
Protein sequence
MNQGSTGGIG PGSYRAPENA ASERIESSVG GKVVRHLVIP AALYILIGAI DRTNVGFAAL 
EMNKALGLSG TQYGFGAGVL FVGYMIAKYP SVLLYEAIGL RRWLTLITAA WGLCSCLMAA
VANEWQLYAL RVLIGFSEGG LSSGLMLYLS LWAPERFRAT ILAIPIASIS IAQVVGAPIS
GLLLDLDRPL GLESWRFMFL VEALPALALA AFAWLHFPDT PADARWLTAT ERDWIAANVK
GARKPAPGQG AERWAVLRSP EGWLCAAIWF CILASNYGIM FWLPQVVKSL SGLSSAMTGV
IVALPWAASG IGLVLNARHS DRTGERYLHV AIPAVVGGAG LLLAYIFGAG LPGLVALVIG
GACTGCTVAA FWAIPTRLLS PGALAMGIVA INMTGSLAGA TVPPLMGYLR ETTGSFLPPA
LLLCGIAVVC ACLTLVARRV AARAEAQ