Gene Saro_3636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3636 
Symbol 
ID5077784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp262262 
End bp263560 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content69% 
IMG OID640481359 
Productmajor facilitator transporter 
Protein accessionYP_001166021 
Protein GI146275861 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACAC CTGCATCCAC CGCCGCCTTC GCGACGATGA CCGCCGGCGG CAAGCCGCTC 
ACCAACCGCT GGCTGGCCTT GGCGCTGCTG GTTCTCGTGG CGGTACTCAA CTACGCCGAC
CGCTTCCTGA TCTCGGGCCT GGCCGAACCC ATCAAGGCGC ACTTCGGCAT CGGCGACGCG
ATGATGGGCC TGCTTATGGG CCCTGCCTTC GCACTGCTCT ATGCCGTATT CACCCTGCCG
ATCGCGCGCC TTGCCGACCG CCGCTCGCGC ATCCTGATCA TTGCGGCCGG ATGCGGCGTG
TGGAGCTTCT TCACGATGCT GTCAGGCATG GCCGCCAGCG CCAACATGCT GGCGCTGGCC
CGGGTCGGCG TCGGCATCGG CGAGGCTGCA TACCAGGCGC CTGCCGCGGC ATTGATCGCC
GCCTATTTCC CGCCGCACGA ACGGGGCCGC GCCTTCGCAC TGCTGGGCAC GGCCATCTAC
GTCGGCCAGA TGACCGGCCT TGCCGGTGGC CCCGCAATCG CCGCGACCAG CGACTGGCAG
ACGGCCTTCC ACGCGCTGGG CATCGCCGGG ATGGTCGTGG CGGCGGCAAG CTTCCTCGTC
ATCCGCGAAC CCGCGCGCGA GGCGGCCGAC AAGGCGGCGC CCGTCCTGCC GATGGGAACG
ACGCTGCGGC TGCTCATCTC GACACCCTCG GTACGCTTCC TTGCCACCAT CATGGCGCTC
GGCTCGCTTT CGGGCGTGAC CTTCGGGATG TGGGGCCCTG CCCTGTTCGA GCGCTCGTAC
GGCCTGACCA CGCAAGAGGC GGGGACGACG TTCGCGCTGA CTTTCGGCCT GCCGGGATTG
CTCGGCGTGC TGGGCTTCGG CTTTCTGGCC GACCGTCTTG GCAAGAACGA TCCGACCATG
CAGCTTCGGC TTACGGCGTT CGCGCTGGGC GGGGCTACGA CGGCGATCCT TGCCGTTACC
TGGACCGACA GCCTCCTGCT CGCCCGCCTG TTCGCCGTGC CGGCAGGACT GCTGGGCGGG
GGATGGTCGG TCGGCGTTCT GGCGGGCCTG CAATATCTGC TGCCCAATGC CCATCGCGCC
ACGGGCACCG CGCTGGTCCT GCTGATCGCC AGCATGTTCG CAACCGTTCT CGGCCCGGTC
CTTGCCGGAC AGTTGAGCGA CTGGATCGCG GGCGCCGGCC CCCACGGGTT GCGCATCGGC
CTCAGCGTCG CGATCCCGAC CGGATACGTC GGCGTCTGGG CCGCGTTCCG CACCGTTCAC
GCGCTGAACC GCGACCGCGA GGCCCTGGCG CAAGCCTGA
 
Protein sequence
MATPASTAAF ATMTAGGKPL TNRWLALALL VLVAVLNYAD RFLISGLAEP IKAHFGIGDA 
MMGLLMGPAF ALLYAVFTLP IARLADRRSR ILIIAAGCGV WSFFTMLSGM AASANMLALA
RVGVGIGEAA YQAPAAALIA AYFPPHERGR AFALLGTAIY VGQMTGLAGG PAIAATSDWQ
TAFHALGIAG MVVAAASFLV IREPAREAAD KAAPVLPMGT TLRLLISTPS VRFLATIMAL
GSLSGVTFGM WGPALFERSY GLTTQEAGTT FALTFGLPGL LGVLGFGFLA DRLGKNDPTM
QLRLTAFALG GATTAILAVT WTDSLLLARL FAVPAGLLGG GWSVGVLAGL QYLLPNAHRA
TGTALVLLIA SMFATVLGPV LAGQLSDWIA GAGPHGLRIG LSVAIPTGYV GVWAAFRTVH
ALNRDREALA QA