Gene Saro_3739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3739 
Symbol 
ID5077887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp375795 
End bp377006 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content66% 
IMG OID640481462 
Productmajor facilitator transporter 
Protein accessionYP_001166124 
Protein GI146275964 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGGTGC ACGGCCGCCT TGGCGAAAAC GGAACCGACC GGAATGCCGG TGCACTCGTC 
GCGGCGGTAT GCGTCGCGGG ACCGGCGCTG GTGGCGCTCG TGCCGATGGC GGCAGCACCC
GCGCTCGTCG CGATGGCGCG GCATTTCGGG CAGAGCGCGG ACGACGAACT GTTCTCGCAG
ATCGTGATGA CGCTTCCGGC GGCGATGTTG ATCCTCGCCG CGCCGATGGC AGGCATGCTC
GCCAACCGTA TTGGCCAGCG AACGGTGCTG CTCGCATCGC TGGTTCTTTA CGTGATCGGC
GGAGCCGGGG TCCTTCTGGT CGCGACCCAG ACAGGGCTTC TGGCACTGCG GCTGCTCCTC
GGCGTTGCCG GCGGGGGGCT TCTTACCTCG AGCCTTGGGC TGATCGGCGA CCATTTCTCC
GGTCACAGGC GCGAGAAGGT GCTGGGTTGG GCAACTTCGT TCTCGTCGCT TCTTGCTGCT
TTGGCTCTGG TTGGCGGAGG ATGGCTGGTC GATCGTGGCG GATGGCAGGC GCCTTTCGCG
CTGTATCTGC TGGGCATTCC GACGCTTGCT GTCGCCACCT TCGCGATCCG CAACACCCCG
CCGCACGAGG AGCCTTCCGC CGAGCGAGGC ACGCTGGCCG GGGTGGCGCG CGTCTGGCCC
TACTATGCGC TTCTCATCCT CCTGACGCTG GGCATGTTCA CTCCGGCCAT CCAGGCGGGA
TTCCTGCTAT CGTCGCGCGG GTTCGGCAGC GCCCAGACCA TCGGTACCGT CATCGCCGCC
ACTTCGGTGG TCGCAATGGT CACGGCGTGG GCCTTCGGGC CCCTCCGACG ACGCATCGGG
TTGCACGGGT TCCTTGCCAT CGACGCCGCC TCGATGGGCC TGGGCATACT CGTCATCGCC
GTGGCGGGTT CTACGTGGCA GGTCCTTGCA GGCTGCTGTC TCGTCGGCAT CGGGGCGGGC
ATGTCGGAGC CGGCCACCGC GTCGATCATC TTCCGCAAGG CCCCGCCCCA CGTCCACGCC
CTTGCCATGG GCCTTATCGT AAGCGCGCTC AACGCGGGCC AGTTTCTCAA TCCGCTGGCC
TTTGACGTGC TCCGCCGGAG CGGCGGGCTG ACCGGGGCAT TCGTCACATT CGGCATGCTT
CTTCTGGGAG CAGCCATGCT CGTCGCAGTG CTGCGACGGG ACGATCTTCT TGAAAGGACC
ATCAAGCCAT GA
 
Protein sequence
MKVHGRLGEN GTDRNAGALV AAVCVAGPAL VALVPMAAAP ALVAMARHFG QSADDELFSQ 
IVMTLPAAML ILAAPMAGML ANRIGQRTVL LASLVLYVIG GAGVLLVATQ TGLLALRLLL
GVAGGGLLTS SLGLIGDHFS GHRREKVLGW ATSFSSLLAA LALVGGGWLV DRGGWQAPFA
LYLLGIPTLA VATFAIRNTP PHEEPSAERG TLAGVARVWP YYALLILLTL GMFTPAIQAG
FLLSSRGFGS AQTIGTVIAA TSVVAMVTAW AFGPLRRRIG LHGFLAIDAA SMGLGILVIA
VAGSTWQVLA GCCLVGIGAG MSEPATASII FRKAPPHVHA LAMGLIVSAL NAGQFLNPLA
FDVLRRSGGL TGAFVTFGML LLGAAMLVAV LRRDDLLERT IKP