Gene Saro_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0234 
Symbol 
ID3916222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp242270 
End bp243529 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID640442959 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_495516 
Protein GI87198259 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTGCGA TACCCTCCCA GAACCCCTCC GGCGGCGCGC TCGCGCCGTT CCGCTACCCG 
GCGTTCCGGG CGATCTGGAC GGCCAACCTG TTCTCCAACA TCGGCTCGAT GATCCAGTCG
GTGGGCGCCG CGTGGCTGAT GACCGAGCTG ACCACGTCGC ACTTGCTGGT CGCGCTGGTT
CAGGCCTCGG CGACGATACC GATCCTGCTG CTCGGCATGT TCGCGGGCGC CATTGCCGAC
AACTACGACC GGCGCCGGGT CATGCTCGCG GCGCAGAGCG GAATGCTTGT CGTGTCCGCC
GTGCTGGCCG TGCTGTCCTA TACCGACAAC ATCGGGCCCT GGTCGCTGCT GGCGCTGACG
CTGATGGTCG GGATGGGAAC CGCGCTGAAT GGTCCCGCAT GGCAGGCGTC CGTCCGGTTG
CAGGTTGCCC ATGCCGATCT CCCGCAGGCG ATCTCGCTCA ACGCCATCTC CTTCAATCTT
GCCCGCAGTG TGGGGCCGGC GCTTGGCGGC ATCCTGATAT CGCTGTGGGA TACGAGCCTC
GCCTTCGCGC TGAATGCGGT CAGCTACATC GGCATGATCG CGGTCCTCGC CATGTGGCGG
CCCGAATCGC TGCCGCCGAT GCGCGAGCCG ATGCTGGGAG CGATCGGGCG CGGCATCCGC
TTCTGCGCGT CGTCATCGCC CATTCGCAAG GTGCTGCTGC GCGGCCTCGC CATGGGCCTT
GGCGCAGCCG GTCTCCAGGC CCTCATGCCC AGCGTCGCGC GCGACATGCT GAAGGGCAGC
GAACTGGACT ATGGCCTGAT GCTCGGCGGA TTCGGCATCG GCTCGATCGT GACGGCGCTG
TGGATTTCCA GACTGCGCCG CCGTCTGGGC AGCGAGACGG TGGTGACCGC CGCCACGCTG
ATCTTCGCGA GCGCGCAAGT GCTGATGGCA TCGGCGACGA ACATGCCGAT GGCGGTGGTC
GCCGCGTTCA TGGGCGGCAT GGGCTGGGCC AGCGCGATGA CCAGCCTCAA CGTCGCGATG
CAGCTTCGCA GTCCCGAGGA CATTCTCGGC CGCTGTCTTT CGATCTATCA GGCGGTGACC
TTCGGCGGCA TGGCGCTGGG CGCATGGGCC TGGGGCACGG TCGCGGACGT GGCGGGGCTG
CCGACCGCGC TGCACGCGGC CGCCCTCTGG CTTGCCGCGT CGCTTGCCTT GCACTTTTTT
GCCCCTATGC CGACGCGCGA GGAAGGACGC CTCGACGTTG TGCCGGAGAG CAGACCGTGA
 
Protein sequence
MPAIPSQNPS GGALAPFRYP AFRAIWTANL FSNIGSMIQS VGAAWLMTEL TTSHLLVALV 
QASATIPILL LGMFAGAIAD NYDRRRVMLA AQSGMLVVSA VLAVLSYTDN IGPWSLLALT
LMVGMGTALN GPAWQASVRL QVAHADLPQA ISLNAISFNL ARSVGPALGG ILISLWDTSL
AFALNAVSYI GMIAVLAMWR PESLPPMREP MLGAIGRGIR FCASSSPIRK VLLRGLAMGL
GAAGLQALMP SVARDMLKGS ELDYGLMLGG FGIGSIVTAL WISRLRRRLG SETVVTAATL
IFASAQVLMA SATNMPMAVV AAFMGGMGWA SAMTSLNVAM QLRSPEDILG RCLSIYQAVT
FGGMALGAWA WGTVADVAGL PTALHAAALW LAASLALHFF APMPTREEGR LDVVPESRP