Gene Saro_1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1535 
Symbol 
ID3917210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1580914 
End bp1582263 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content61% 
IMG OID640444276 
Productmajor facilitator transporter 
Protein accessionYP_496810 
Protein GI87199553 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGCG ATACCATGAT TGCTCCCACT GAAGTCCCGC CTTCGCGGAT GCCTTATGGT 
GCCCTCTGCC TGTTGATGGT GGTCTATGCG TTCAACATGC TGGATCGGCA GATAGTGACC
ATCCTCGTAG AGGCGATAAA GGCTGATCTT GGTCTTGCAG ACTGGCAGAT CGGGATTATC
AGCGGCCTCG CCTTTGCGAT CTTTTACACC TTGCTTGGCA TACCGCTCGC ACGGATCGCG
GACCGCGGAA ACCGGGTTGG AATGATCGCG GTCTCTTTGA CTGTCTGGAG CGGCTTCACT
GCGCTGTGCG GGTTCTCGCG CAATTTTGTC GAGTTGCTTG TCGCACGCGT CGGTGTCGGC
GTTGGCGAAG CAGGATGCAC TCCTGCTGCG CACAGCCTCA TCACGGATTA TGTCGCACGC
GCGCAACGCG GCCGAGCACT CGCCCTTTAT TCGTTGGGCG TGCCGATCGG CTCGCTCGCT
GGCCTTGTTC TCGGCGGTAT ATTGCTGGCC ACTCTGGGTT GGCGATCCGC TTTCGTGATT
GCCGGTCTGC CCGGGATTAT CCTGGCGGTG ATCGTCTGGT ATGCACTTGA AGAGCCGCGC
AAACATTTGG TCGCGGCCCG GGAAACGGGG CCTGCGCACA TACCGCTGGC ACAAGCTCTC
GCTACTTTGC GTCGATTGCC AAGCTTTTGG TTGGTCTCGC TTGGCACCGC AATGGCTGCC
TTCGGCTACT ACGGCCAGTC GTCGTTTTTC GCTTCGCTCT ACCTGCGCAC GCATGGAGCA
GGGATCGATG AAATGGCCAT GGGTCACGAT ATGTCGCCAA CTGTGTTCCT GGGGCTGAGC
CTGGGATTCA TCGTCGGAAT CGTGGGTATG GCCGGCACCT TTGTCGGCGG ACTTCTGGCG
GATCTGGCCG CGCGGGGCGG GGTCAAGGGC TATACGGTAG TTCCGGCGAC TTCGCTCATC
CTGGCGGCAC CGCTATTTGC GGCTGCGGCG CTTGCCAATA CGGTCGCACT GTCCTTCGCG
TTTCTCAGCT GTGCAATCTT CGTCCACGCG CTCAACTACG GTTCGGTCTT TGCTTCTGTG
CAGACCCTTG TTCCCGCAAG GTTGCGCGCG ATGGCGGCCG CGCTGCAGTT GTTCGTGACA
AACGCGATTG GCCTCGCGCT TGGCCCGCTG TTCGTGGGCC TTGCTAGTGA CCTGCTGGGC
TCCATGCTCG GCAAGGAGCA GGGCTTGCGT GTCGCGATGG CGCTAGTCGT GCTACCGCTG
GCGGTCGGGG CCGCATTTTT CTGGGCTGCG AGCCGCCGGA TCGATGCCGA CGAGGCTCTA
GGTGCGCGTC TGGGGGCCGC CCTCCCTTAA
 
Protein sequence
MTGDTMIAPT EVPPSRMPYG ALCLLMVVYA FNMLDRQIVT ILVEAIKADL GLADWQIGII 
SGLAFAIFYT LLGIPLARIA DRGNRVGMIA VSLTVWSGFT ALCGFSRNFV ELLVARVGVG
VGEAGCTPAA HSLITDYVAR AQRGRALALY SLGVPIGSLA GLVLGGILLA TLGWRSAFVI
AGLPGIILAV IVWYALEEPR KHLVAARETG PAHIPLAQAL ATLRRLPSFW LVSLGTAMAA
FGYYGQSSFF ASLYLRTHGA GIDEMAMGHD MSPTVFLGLS LGFIVGIVGM AGTFVGGLLA
DLAARGGVKG YTVVPATSLI LAAPLFAAAA LANTVALSFA FLSCAIFVHA LNYGSVFASV
QTLVPARLRA MAAALQLFVT NAIGLALGPL FVGLASDLLG SMLGKEQGLR VAMALVVLPL
AVGAAFFWAA SRRIDADEAL GARLGAALP