Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3888 |
Symbol | |
ID | 5077372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 56617 |
End bp | 57801 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640480995 |
Product | major facilitator transporter |
Protein accession | YP_001165657 |
Protein GI | 146275496 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0648184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGCGA TGATCCAGGC ATTGCAGTCA CTTGGGTTTA CAACTTTGCT TCCGGCCTTG GGTGTCATGG CAAACGATCT GGGGGCAAGC GGTTCAAACC AGCGGCAATG GGTCATCGGC ACCTTCCTGA TCTGCTCCGG CCTGTTTTCG CTCGTCCCGG GGACGATCTC CGACCGGCTT GGGAGGAAGC CGGTCCTGCT GGTTTGCATG GGCCTGTTCG CCCTGATCAA TCTGCTTTGT GCTTTCGTGG CTGACTTTTC CGTCCTGCTG GCGGCCCGCG CGCTGCTTGG CTGTGCCTCG TCCGCCCTGA CCGTCCTTCC TCTGGCGATC ATTCGCGACC GCTATCAAGG CGAAGACATG GCCAAGCTGC AAGCCTTTGT CGCGATGCTG TTTATGGCAG TCCCGACCCT TGCCCCAAGT TTGGGTTATG TGATCTTTAC TACGCTCGGC TGGCGTGCCG TCTTCATTGT CATCGGGGTG CTGTCGATGG GAGTGTCAGC ATGGTATTTC TACCGCATGG AAGAAACGCT GCCGCTTTCC CGGCGGCAGT CCCATGGTGC CAGTGAATTG TTCGGCAACA TCCGCCTCGT TCTGACAAAC CGCCGGTCGA TCGGTTACGT CATTGGCATG GCCCTGATTT TCGGCGCTCA CTTCGGCTTT ATCAACAGCT CGCAGCAATT GATCGGCGAA CATTTCGGGG CCGGCGGGGC CTATTCGGTA ATCTTTGGCT TGATGGCGGG CTCGATGATG CTGGCCAGCA TTGCCAATTC TGCGATCGTA CACCGTTTCG GCATCCGGGG GATCGGCCAT GCCGCCATCC TGTGCCATTT CCTTGTGTCG ATCGGCCAGA TCTATTTGGC ATCCCGACCG GGCGAAACCC TGCTTCAGTT TGTGCTTCTG ACATCCGCGA ACATGTGCTT GCTGATCACG GTCTTCATCA ATTTCACCGC GATTTCCCTG CAGCCATTCG GCAAGGTGGC CGGGGCCGCG GCCTCTGTTC AGACGTTCTT CCGTTTGGTG CTTGGCGCGG GGCTGGGCGC ACTGATCGGG CAGGCCTACA ATGGTTCCCC GCTGCCTCTG GCCTATTCGT TCTTCGGGGT GGCGAGCGTG ACCATCGTGC TTGTACTGTT CAGCGAAGGC TGGCGATTGT TCGGGGGACC TATACCGGTT CAGGCTGCGA ACTGA
|
Protein sequence | MMAMIQALQS LGFTTLLPAL GVMANDLGAS GSNQRQWVIG TFLICSGLFS LVPGTISDRL GRKPVLLVCM GLFALINLLC AFVADFSVLL AARALLGCAS SALTVLPLAI IRDRYQGEDM AKLQAFVAML FMAVPTLAPS LGYVIFTTLG WRAVFIVIGV LSMGVSAWYF YRMEETLPLS RRQSHGASEL FGNIRLVLTN RRSIGYVIGM ALIFGAHFGF INSSQQLIGE HFGAGGAYSV IFGLMAGSMM LASIANSAIV HRFGIRGIGH AAILCHFLVS IGQIYLASRP GETLLQFVLL TSANMCLLIT VFINFTAISL QPFGKVAGAA ASVQTFFRLV LGAGLGALIG QAYNGSPLPL AYSFFGVASV TIVLVLFSEG WRLFGGPIPV QAAN
|
| |