Gene Saro_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2604 
Symbol 
ID3917019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2815108 
End bp2816343 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content66% 
IMG OID640445363 
Productmajor facilitator transporter 
Protein accessionYP_497874 
Protein GI87200617 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACC AGAGCAATTC AGATGCCATC CGGGCAGGCG CTATCCGGAG CGCGCCGTCG 
CCTGCCGACG TCAACGGCTG GCCCGCCGTC GCGAGTGCCA TTCTCCTTGG CACGATCGGA
GTGCTCTCGT TCATCATCCA GCCGGGGCTG GTGCAGGGCT ATGTGACACA TCTGGGCCTT
GGCGAGGCCG CGGCGGTGGA CCTTGCCGGC ATCGAGATGC TGGGCGTCGC CCTTGCCACC
ATCGCGCTGG CCATGTTCGG CGGGCGGGTG GACTGGCGGC ACGTGGTCCT TGCCGGTCTC
GGCCTTGCCG TGGTGGGCAA TGCCGGTTCA GCCGCGACGC AGGGCGCGCT CTTCGCGCTG
TTCCGGTTCG TGTCGGGCCT TGGCGAAGGC ACGATCATAT CGATCAGTTT CACGTTCGTC
GGCGTGACCC GCCGGACCGA GCGCAACGTG GCGCTCTATC TCGTGCTCCT GCTGACGTAT
GGCGCATTTA CCCTGTGGCA GTTGCCGGCC ATTCTCGACG CCATCGGCCT GCCGGGCCTG
TTTGCCGCCT TCGCCGCGCT ATCGGCCCTG GCGGTGGTGA CGGTACCGCT TGTCCCAAGG
GCCTATCACG CCCAGGAAAT GGCCAATCCC GAAGCCCGCC AGCTTTCCCG CGTCCTACTG
GCGGTCGCTC TGGCCGGGGT TCTTGCCTAC AACCTTGCCC AGGGAATCGC ATGGGCCGTT
CTGTTCCTTG TCGGCATCGG AGCCGGGCTT GGCGAGCAGC AGGTGGCCGA CAGCCTGTTC
CTGTCGCAGG TCGTGGCGAT TGCCGGCGCG CTGGCATCGG TGTTCCTCGC CGCCAGGCTG
AACCGCAACG CCGCCATCGC TTTCGGCATA CTGGTGGGCG CTGCCAGCAT TGCCCTGCTT
GAAGGCGCGC CTTCGGCGGC GTTCTTCACC GTGGGCGTGT GCGGCTTCAA CTTCCTGTGG
AACTTCGTCC TGCCCTTCAT TCTCGGCCGC ATCTGCGATT TCGATACGAG CGGGCGGATG
ATGTCGCTTG CCATCGCCAT GCAGATGACC GGGCTGGGCG GAGGCCCCCT GCTGGCGGCG
CGGCTGATCG ACGGTAACGG CTACGGTCCG GTACTGACGC TCTGCATCGG CCTGTTCATC
GCCAGCTTCC TGCTGCTGCA ATTGCCCATG CGCAGGCACG GAGCGCTTCT TGCGTCCACC
CCCGCTCCTG CGGCTGTCCT TTCAAACGCC ATCTGA
 
Protein sequence
MSDQSNSDAI RAGAIRSAPS PADVNGWPAV ASAILLGTIG VLSFIIQPGL VQGYVTHLGL 
GEAAAVDLAG IEMLGVALAT IALAMFGGRV DWRHVVLAGL GLAVVGNAGS AATQGALFAL
FRFVSGLGEG TIISISFTFV GVTRRTERNV ALYLVLLLTY GAFTLWQLPA ILDAIGLPGL
FAAFAALSAL AVVTVPLVPR AYHAQEMANP EARQLSRVLL AVALAGVLAY NLAQGIAWAV
LFLVGIGAGL GEQQVADSLF LSQVVAIAGA LASVFLAARL NRNAAIAFGI LVGAASIALL
EGAPSAAFFT VGVCGFNFLW NFVLPFILGR ICDFDTSGRM MSLAIAMQMT GLGGGPLLAA
RLIDGNGYGP VLTLCIGLFI ASFLLLQLPM RRHGALLAST PAPAAVLSNA I