Gene Saro_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1953 
Symbol 
ID3917268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2071344 
End bp2072978 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content63% 
IMG OID640444700 
Productgeneral substrate transporter 
Protein accessionYP_497227 
Protein GI87199970 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCGTGA ATGCATCCGT AACGAAGAAC GACGCCGCGA AGCCGAGCGC GTCGGATATC 
AGGCTCGTGA TCGCCGCCAG TTCGGCGGGT ACCGTGTTCG AATGGTACGA TTTCTTCATC
TACGGCACGC TGGCATCAAT CATCGGCAGG ACGTTCTTTC CCTCGGACAA CGCAACGCTC
CAGGTGCTGC TGGTGTGGGC GGGATTTGCC GTCGGCTTCG GCTTCCGTCC GCTCGGCGCA
GTCTTGTTCG GCTATCTCGG CGACAAGCTC GGCCGCAAGT ATACCTTTCT CGTCACGGTC
ACGCTCATGG GTGTCGCCAC GGCGGGCGTC GGCCTGATCC CCTCTGCCGC GACCATCGGC
CTTGCCGCAC CGGCCATCGT CATCCTGCTG CGCGTGCTGC AAGGGCTGGC ACTGGGTGGG
GAGTACGGCG GCGCTGCGAT CTATGTTGCA GAGCATGCAC CGGGCGGCCG TCGCGGCTAT
TACACCAGCT ACATCCAGGC CAGCGTCGTG GGCGGCTTCG TGCTGAGTCT GATCGTAGTC
CTGTCCAGCA AGGCGCTGAT GAGCGATGCC GTGTGGAACG ACTGGGGTTG GCGCGTGCCG
TTCCTGGTCA GCCTCGCGCT TCTCGCGATT TCGTTGTGGA TGCGCATGAA GCTGTCGGAA
AGTCCGGTGT TCCAGGCGAT GAAGGAGGAG GGCGAGCTTG CCGGTAATCC CTTCGTCGAA
AGCTTCACCT ACCCTGGCAA CAAGCGCCGC ATTTTCATCG CGCTGTTCGG CATCGCCGCC
GGGCTAACCG TGATCTGGTA CACGGCGATG TTCTCCGGCC TCAGCTTCCT CAAGTCTGCA
ATGCGCATGG AGGATACTCT GGCCGAGGTC GTCGTCGGTA TCGGCGCCAC ACTCGGAATG
GGCTTCTTCA TCTACTTCGG CTCTCTTTCC GACCGTATCG GCCGTAAGAA GCCCATCATC
ATCGGCTATG CCGTCACGCT GCTAATGCTC TTTCCCACGT TCTGGCTGAT GGGGGCCGCC
GCCAATCCGC AACTCGCCGA GGCGGCAGAG CGCAACCCGG TCGTCGTAGC CGGGCCTGAC
TGCAACTACA GCCCCTTTGC CTCTGAACAA GTCAGCAATT GCGGCAAGCT CCTGGCTGAC
CTGGCGGCGT CCGGTGTGTC TTATAGTCTG CGCGATGACG CTGTGTTTGG CATGACCGCA
GGTGGTTCAG CGGTTGATCT TGCCAGCTAT CCGTGGACGG ACAAGGCTGC TGCGCGCGCC
AAGGCGCTCC AGTCCGAGCT TTCCGCGCAT GGCTACGATT TCGCCAAGGT CCAGCCCTCG
CTCGGCCGGA TTGTCGCGGT TATCGGTGCG CTGCTGGCGC TCATGGCGAT GTCCGGTGCG
ACCTACGGGC CGGTGGCCGC CCTTCTTTCC GAGATGTTCC CGCCGCGCAT CCGATACAGT
TCGATGTCGA TCCCGTATCA TCTCGGCACG GGCTACTTCG GGGGTTTCCT GCCGCTGATT
TCCAGCTACA TCGTCGCGCG CACCGGCGAT CCCTATGCCG GGCTATGGTA CACTTGGGTG
GTCGTCCTGG TCGCGCTCCT CGTTGCGGCG TGGGGTTTGC GGCCAGGCCT GCCCGCCGAC
TTCACGGATG ACTGA
 
Protein sequence
MCVNASVTKN DAAKPSASDI RLVIAASSAG TVFEWYDFFI YGTLASIIGR TFFPSDNATL 
QVLLVWAGFA VGFGFRPLGA VLFGYLGDKL GRKYTFLVTV TLMGVATAGV GLIPSAATIG
LAAPAIVILL RVLQGLALGG EYGGAAIYVA EHAPGGRRGY YTSYIQASVV GGFVLSLIVV
LSSKALMSDA VWNDWGWRVP FLVSLALLAI SLWMRMKLSE SPVFQAMKEE GELAGNPFVE
SFTYPGNKRR IFIALFGIAA GLTVIWYTAM FSGLSFLKSA MRMEDTLAEV VVGIGATLGM
GFFIYFGSLS DRIGRKKPII IGYAVTLLML FPTFWLMGAA ANPQLAEAAE RNPVVVAGPD
CNYSPFASEQ VSNCGKLLAD LAASGVSYSL RDDAVFGMTA GGSAVDLASY PWTDKAAARA
KALQSELSAH GYDFAKVQPS LGRIVAVIGA LLALMAMSGA TYGPVAALLS EMFPPRIRYS
SMSIPYHLGT GYFGGFLPLI SSYIVARTGD PYAGLWYTWV VVLVALLVAA WGLRPGLPAD
FTDD