Gene Saro_0800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0800 
Symbol 
ID3915854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp850094 
End bp851695 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content66% 
IMG OID640443531 
Productmajor facilitator transporter 
Protein accessionYP_496079 
Protein GI87198822 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAG AACCGCGCCG CTCACTGGCC CCCGGCGCAT GGTACGCGCT CGTCCTCGTC 
GCGCTGACCA ATGCGATGAG CCTGCTCGAC CGGCAGATCC TCGCGATCCT CGCGCCTGCG
ATCAGGAAGG ACCTCCAGAT CGGCGATGCC GAGATGGGCC TGCTCTACGG CACCGTGTTC
GCGTTGTTCT ACGCGCTGTT CTCGCTGCCG GTGGGGCGGC TGGCCGACGG CTGGGTGCGG
ACGCGCCTTC TGGCGATCAG CCTGCTGTTC TGGTCGGCCG CCACCGGGCT TGCAGGGCTT
GCCTCCAGCT TCGCCATGCT TGCGCTGTCG CGCCTTGGCG TAGGCATTGG CGAGGCTGCG
ACCCAGCCCG CCGGAACCTC GCTCGTTTAT GACTTCTGGC CCAAGCACCG CCGTGGTTTC
GTCATGTCGG TCATGGCGTC CGCCATTGCG CTCGGTCTCG GCGGCTCGCT GGTGCTGGGC
GGCGTGGCGG CGGGGTGGTG GGATGCTGCT CATGCCGTCG GCACAGCGCC ATTCGGCCTC
AAGGGTTGGC AGTTCGCATT CCTCGTCGCC GCCGCCCCCG GCTTCGTGCT GGCCGCTTTC
CTGTGGACGC TCAAGGAGCC GGTGCGCGGG CAGATGGACG GGATCGAGAC CCCGCCCGAA
CCGCACCCGT TCGCCAGAAG CCTGTCGCTC CTCGGCTCGG TCACTCCCGG CTTCCACTGG
ATCGGCATGA AGGGCAGGGG CGCCAGCCCG GCGATGGTGC GCGGCAATCT GATCGCGCTC
GTCCTCATCG CCCTCGCGGC CTATGGGCTC ACGCAATTCA GTACCGCGAT CAGCCCCAAG
CCCGCGATGG TCCTCGGCAG TGTCGCGATC AACCCGCATG CTTTGCAGTG GTGCGTGATC
GGCCTGGGCG TATTCGTGAT CGTCAACCTG ATGCAGGGCA TGAAGCTGGG CGATGCGCAG
GCCTTCCGCG TCATCTCGCG CTCGCCAACG GTAATGATGT GCATTGCCGT CGGCACGCTG
CAATCGACGC TCAACTACGG GATGATGGCT TTCAACCCCA GCTTCCTGAT CCGCAGCTAC
GGCCTCTCGA TGCAGGAGAC CGCCTTGCAG TTCGGTCTGC TGTCGGCAGG CATGGGCATC
GTCGGTCCGC TGGTCTGGGG CCCGCTCTCT GACTGGCTGC AACAGCGCTT CCCCGGCTCG
GGCCGCGCCT GGGTCGCGCT CTTCGCGATG GCAGTGTCGC CGGTCCTCTC GTTCTGGGTC
TATTACGCCG CCGATCCCGG CAGCTTCTAT GCCCGCTTCC TCGTCTACAG CTTCATCCTC
ACCGGCTGGA TGCCGCCGCT CTACGCGATC ATGTACGATC AGGTGCTGCC GCGCATGCGG
GGGCTAACCG CCAGTCTCTA TCTTCTGGTG ATGACCATCC TCGGCATGGG CATCGGCCCC
TACGTCGTCG GCCTGCTTTC CGACGCTACC GGTTCCCTGC GCACCGCCAT GCTGTCGATC
AACACCGTGG CGATTCCCAT CGCCGTCCTG ATGGTCCTGA TTGCCCGCCG CGCCGAACGG
GACGAGGCCG GGTTGCTGGA ACGGGCGGGG ACATCGGTTT GA
 
Protein sequence
MTTEPRRSLA PGAWYALVLV ALTNAMSLLD RQILAILAPA IRKDLQIGDA EMGLLYGTVF 
ALFYALFSLP VGRLADGWVR TRLLAISLLF WSAATGLAGL ASSFAMLALS RLGVGIGEAA
TQPAGTSLVY DFWPKHRRGF VMSVMASAIA LGLGGSLVLG GVAAGWWDAA HAVGTAPFGL
KGWQFAFLVA AAPGFVLAAF LWTLKEPVRG QMDGIETPPE PHPFARSLSL LGSVTPGFHW
IGMKGRGASP AMVRGNLIAL VLIALAAYGL TQFSTAISPK PAMVLGSVAI NPHALQWCVI
GLGVFVIVNL MQGMKLGDAQ AFRVISRSPT VMMCIAVGTL QSTLNYGMMA FNPSFLIRSY
GLSMQETALQ FGLLSAGMGI VGPLVWGPLS DWLQQRFPGS GRAWVALFAM AVSPVLSFWV
YYAADPGSFY ARFLVYSFIL TGWMPPLYAI MYDQVLPRMR GLTASLYLLV MTILGMGIGP
YVVGLLSDAT GSLRTAMLSI NTVAIPIAVL MVLIARRAER DEAGLLERAG TSV