Gene Saro_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3166 
Symbol 
ID3918208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3380417 
End bp3381637 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID640445950 
Productnucleoside:H+ symporter 
Protein accessionYP_498435 
Protein GI87201178 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.306334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCAACGC TGTCACTCAA AATTCGCCTG TTCGCCATGA TGGCACTGCA ACTCGCCGTG 
TGGGGGGCCT GGGCGCCCAA GCTGTTTCCC TACATGGCCA TGCTGGGCTT CAGCGCCGGG
CAGCAAGGGC TGGTCGGAAC GTGCTGGGGC ATCGCCTCGG TAGTGGGGAT TTTCTTCTCC
AACCAGTTCG CCGATCGCAC GTTTGCGGCC GAGCGATTCC TTGCCGGCAG CCACCTCATC
GGCGGCCTCG CCCTCCTCGG CGTCGCCTTC AGCCAGACCT TCGCGCCGTT CTTCCTGTGT
TACCTGATCT ACAGCCTGGT CTACGTGCCG ACACTGTCGG TGACCAATAC CGTCGCCTTC
ACCCACCTGC CCAATCCGGC CGAATTCGGC TCGGTCCGCT CGGGCGGCAC GGTAGGCTGG
ATCATGGCGA GCTGGCCCTT CGTCTTCCTG CTGGGCGCCC ATGCCGATGC CAGCCAGGTG
CGCTCGATCT TCATCGTCGC GGCGGTCATC TCGTTCGTCA TGGCCGCCTT CTCGCTGACC
CTGCCGCACA CCCCGCCCAA GACCGGCGAG GGCATCGACA AGCTCGCCTG GCGACGCGCG
CTGGGCCTGC TTCGGAAGCC GGTCGTGGCG GTGCTGTTCC TCGTGACGTT CATCGATTCG
GTCGTCCACA ATGGCTACTT CGTGCTGTCC GACGCCTTCC TGACGAACCG CGTGGGCATT
GCCGGCAACC TATCGATGGT GGTGCTGAGC CTTGGGCAGG TAGCCGAGAT CCTCACCATG
CTCGTGCTCG GAACCGTGCT GGCCCGGCTC GGGTGGCGCT GGACGATGAT CGTCGGCATC
CTCGGACATG CCGCGCGCTT CCTCGCCTTC TCGTTCCTTG CCGACAGCGT TCCCGCGATC
ATCGCGGTCC AGTTGCTCCA CGGCATCTGC TACGCCTTCT TCTTCGCGAC GGTCTATATC
TTCGTTGACG AGGCCTTTCC CAAGGACGTG CGCTCTTCGG GCCAGGGGCT GTTCAACCTG
CTGATCCTGG GCCTGGGCAA CATGGTGGCC AGCTTTGCCT TCCCGGCCCT CGTCTCCCGT
CTCACCGGGG CCGACGGCAA GGTCGATTAC CAGGCCGTTT TCCTTGTCCC CGCAGGGCTT
GCGGCCCTCG GCGCGGTGCT GCTGCTGGTC GCCTTCCGCC CCCAAACGCA TGGCCCCGCG
AGGCCCGAGG AGATTGCCTG A
 
Protein sequence
MSTLSLKIRL FAMMALQLAV WGAWAPKLFP YMAMLGFSAG QQGLVGTCWG IASVVGIFFS 
NQFADRTFAA ERFLAGSHLI GGLALLGVAF SQTFAPFFLC YLIYSLVYVP TLSVTNTVAF
THLPNPAEFG SVRSGGTVGW IMASWPFVFL LGAHADASQV RSIFIVAAVI SFVMAAFSLT
LPHTPPKTGE GIDKLAWRRA LGLLRKPVVA VLFLVTFIDS VVHNGYFVLS DAFLTNRVGI
AGNLSMVVLS LGQVAEILTM LVLGTVLARL GWRWTMIVGI LGHAARFLAF SFLADSVPAI
IAVQLLHGIC YAFFFATVYI FVDEAFPKDV RSSGQGLFNL LILGLGNMVA SFAFPALVSR
LTGADGKVDY QAVFLVPAGL AALGAVLLLV AFRPQTHGPA RPEEIA