Gene Saro_1295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1295 
Symbol 
ID3917927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1339901 
End bp1341172 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content69% 
IMG OID640444032 
Productgeneral substrate transporter 
Protein accessionYP_496573 
Protein GI87199316 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.659237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGA TTCGCCAGCA CGGCCGCGTG CTGACCGCCA GCCTCGTCGG CACGGCGGTC 
GAGTTCTACG ATTTCTATAT CTACGCGACG GCGGCGGCGC TGGTCATCGG TCCGCTTTTC
TTTCCCACGG AATCGCAGGC TGCCCAGACT CTCCTCTCCT TCATGACCTT CGGCCTGGCG
TTCTTCGCGC GGCCCGTCGG GGCGGTGGCC TTCGGGCATT TCGGTGATCG CATCGGGCGC
AAGTCGACGC TGGTCGCGTC GCTCATGCTG ATGGGCGGAT CGACGCTGCT GATCGCGTTC
CTGCCAACTT ATGCGATCGC CGGGTGGATC GCCCCGGCGC TGCTTTGCCT CCTGCGCTTC
GGTCAGGGTT TCGGGCTTGG CGGGGAGTGG GGCGGGGCGG CGCTGCTGGC GGTGGAGAAT
GCGCCGCGCG GCTGGGAAGC GCGCTTCGGC AGCGCGCCGC AGCTTGGCGC GCCGTTAGGG
TTCTTTTTCG CCAACGGCCT GTTCCTCCTG CTCGGCCTCG GCCTCTCGGA CGCGGACTTT
GCCGCCTGGG GCTGGCGCGT GCCGTTCCTT GCCAGCGCGG TGCTGGTCGG CGTGGGCTTG
TGGGTGCGGC TGAAGATTGG CGAAACGCCC TCGTTCCGTG AAGCTATGGA AAAGGCGCCG
CCGCCGCAGG TGCCCATTTC CCGCCTGCTG CGCGGTCACT CCCCGGCCGT CCTCGCCGGG
ATCGCCGGCG TGGTCGCGTG CTTCGCCATC TTCTACCTCG CGACGACCTT CGCCCTGTCC
TTTGCCACGA CCGCGCTGGG CTATGCCAAG CAGGAATTCC TTGCGGTCCA GCTTGGCGCC
AACACCTTCT TCGCGCTCGG CATTCTCGTG GCGGGCTACT GGGCCGACAA GACTTCGGTC
CGGCGTGTCC TCGGCACGGG CGCGGCGCTC ACCGCGGTGC TCGGCATGGT CTTCGGGACC
GGGCTGGGCT CGGGCTCGCT TGCGGTGGTC TTCGCGACGC TCGCCTCGTC GCTGTTCATC
ATGGGCCTTG CCTACGGGCC GCTCGGCGCG TGGTTGCCGA CGCTCTTTCC GGTGTCGGTG
CGCTACACCG GCATTTCACT GGCGTTCAAC GTCGGCGGGA TCATCGGGGG CGCGCTGGCG
CCCTTCGCCG CGGCCTGGCT CGCGGGAGTG GGGGGCACCG CCTATGTCGG CGTGTTCCTG
ACCCTGGCTG GGGCGATGAC GCTGGCGGGC GTACAGTTCT CCCCCCGGGG GGCCGAACCC
GGGGAGGCGT GA
 
Protein sequence
MNPIRQHGRV LTASLVGTAV EFYDFYIYAT AAALVIGPLF FPTESQAAQT LLSFMTFGLA 
FFARPVGAVA FGHFGDRIGR KSTLVASLML MGGSTLLIAF LPTYAIAGWI APALLCLLRF
GQGFGLGGEW GGAALLAVEN APRGWEARFG SAPQLGAPLG FFFANGLFLL LGLGLSDADF
AAWGWRVPFL ASAVLVGVGL WVRLKIGETP SFREAMEKAP PPQVPISRLL RGHSPAVLAG
IAGVVACFAI FYLATTFALS FATTALGYAK QEFLAVQLGA NTFFALGILV AGYWADKTSV
RRVLGTGAAL TAVLGMVFGT GLGSGSLAVV FATLASSLFI MGLAYGPLGA WLPTLFPVSV
RYTGISLAFN VGGIIGGALA PFAAAWLAGV GGTAYVGVFL TLAGAMTLAG VQFSPRGAEP
GEA