Gene Saro_0160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0160 
Symbol 
ID3918691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp156827 
End bp158104 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content66% 
IMG OID640442885 
Productgeneral substrate transporter 
Protein accessionYP_495443 
Protein GI87198186 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAACCGA CATCATCCGA GCCGCGCCAC TTGCCCACGA AGCGGCAATT CGCGGCAGTC 
ATTTCCGGCA ATGCACTGGA ATTCTATGAT TTTCTGAGCT TCAGCTTCTT CGCGGTAAAT
CTGTCGCGGG TCATGTTTCC GGCAGGGGCG CCGGGCAGTG CGCTGCTGCT GACGCTCATG
ACCGCGGGCG CGGGCTTCGG CGCGCGGCCC TTGGGGGCGA TCCTGTTCGG AGTGCTGGGC
GACCGGATCG GACGCCGCCC GACCATGCTG GCGACTTTCG CGCTGATGGG GGTGAGCGCG
CTCGGGCTCG CCCTGACGCC GGATTACGCG GCGATCGGAC CGGCAGCGCC GATCCTCGTC
GTTCTGTTCC GTCTTCTGCA GGGCCTTGCC GCCGGTGGCG ACGTGGGGCC GACGACAGCG
TTCCTCGCCG AAAGCGCGCC GTCCGAAAGA CGCGGGATGC TGATCGCGCT GCAGCTTGTC
GCGATGCGCA TGGGAGTTCT GGCGAGCGGA CTGGTGGGGC TGGTCCTGGC GAGTGTCCTG
ACGCCAGCGC AGCTCGACAG TTTCGGCTGG CGCATCGCCT TTGCCATCGG TGCGGGTATC
GTGCCGTTGG CTTTCATCCT GCGCCGCAGG CTCGACGAGA CGCTTCATAT GCCGGAAACC
GGTCCTGACG TGGTGACCGA ATTGGCCCCG CGCGCCTACG CGGCGGCGCT CTTGGGGGTA
TGCGGCTTCC TGTTGGCAGG TGCGGCGGGG GATTTCCTGT TCATCTACGC AGTGTCGTTC
CTGAAGATCG CGGTGACCAA CGGTTACATC GTGCAGATGG CGGCGGCGGG AACGCAGATC
GTCGGGCTGG TGCTTGGTGG ATGGCTCGGG GACCGCATCG GGCGACGGCG GGTGAATCTC
GTCACAGCAA TACTGGCGGC GCTGACTTCG TTGCCGCTGT TCCGCTGGGG CATCGAGGGA
AGCGCACCGG CTCGGTTCGG CGTGGCGGCG GCCCTTCTCC TGCTCATGGC GACGGTTTCG
GCTGCGGTTG CCTATGCCGC GTTCGTCGAG ACGACGCCCA AGCGGCATCG TGCGGGGCTT
GTCGGCATCG GTTATGGCGT GATGGTCGCG CTGACCTTTG GCCTGACACC AGTGGTATTG
ACCCGGTACA TGACCGCGAC CGGCGACCTC GCGGCACCCG GCTATGCCTT CGTGGTCGCC
GCGCTTTTGC TGGTGGCCTC TGCGCTGCTT CTGCCCGAGC GGAGACCCCG GCACATCGGG
AAGGTTCGTT TCACCTGA
 
Protein sequence
MEPTSSEPRH LPTKRQFAAV ISGNALEFYD FLSFSFFAVN LSRVMFPAGA PGSALLLTLM 
TAGAGFGARP LGAILFGVLG DRIGRRPTML ATFALMGVSA LGLALTPDYA AIGPAAPILV
VLFRLLQGLA AGGDVGPTTA FLAESAPSER RGMLIALQLV AMRMGVLASG LVGLVLASVL
TPAQLDSFGW RIAFAIGAGI VPLAFILRRR LDETLHMPET GPDVVTELAP RAYAAALLGV
CGFLLAGAAG DFLFIYAVSF LKIAVTNGYI VQMAAAGTQI VGLVLGGWLG DRIGRRRVNL
VTAILAALTS LPLFRWGIEG SAPARFGVAA ALLLLMATVS AAVAYAAFVE TTPKRHRAGL
VGIGYGVMVA LTFGLTPVVL TRYMTATGDL AAPGYAFVVA ALLLVASALL LPERRPRHIG
KVRFT