Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3440 |
Symbol | |
ID | 5077589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 41794 |
End bp | 43002 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640481164 |
Product | major facilitator transporter |
Protein accession | YP_001165826 |
Protein GI | 146275666 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAGGC AACACGAACC GGGCGTGATG ACCGGCATAG CGCTGCTCCT GCCGATCACG CTGACCACGA TGGCGATCGT GCTGCTCGCG CCAGTCCTGC CGCAACTCAT GGCCGAATTC GCCGATGTGC CGGGGCATGA ATACCTGGTG CCGATGGTGC TGACGCTGCC TTCGCTGTGC GTCGCCGTAC TGTGCCCTTT CGCGGGAATG CTGGGCGACT ACTTCGGGCG GCGCCAGCTG CTGCTGGCTT CGTTCGTGCT TTATGCCGTG GTCGGTGTAG CGCCGCTGTT CCTCACGGAC CTGGTCCACA TCCTGATTTC CCGCATCGGC GTGGGCGTGG CCGAAGCGAT GATCTACGTC CTTTCGACGA CGATGATCGG CGATTACTAC GAGGGCGAAC GGCGCGACCG CTGGCTGGCG GGGCAGACGG CTTTCGCCTC GGTCTCGGCG CTGGTGTTCT TCAACATTGG CGGTCTGCTG GGCGAGGCGG GTTGGCGAAC GCCCTTCATT GTCTATGCCT CGGCGCTGGT GATGTTTGCC CTGGTTCTGA AATTCACGTG GGAACCGTCT GGAGACAAAG CTGAAATCGC AGATGCCACG AAGGCGTCCG CCGCCCCTCA CAACCTGAGC TGGGCGATGT TTCCCTGGGG AAACCTTGCG CTGATCGCGG GCATAACCGT CTATGGGGCG ATCTTCTTCT ACACCGTGCA GATCCAGGCA TCGGCGGGCC TTGCCGAGCT GGGACTGTCC AGCGCCGCGC GCATCGGGTT CCTTACCTCG CTGGCCAGCC TTGGCGTTCC CTTGGGCACG TTCATCTATT CGCGGCTTGG CCGGATCGGT GTTGGAAAGC TCCTTCTCGC CGAATTCGGC ATTCTTTCCA TAGGTTTCCT GTTGATGGGG CGGACCGGAT CGGTCCCGGG ATTCCTTGCC GGATGCTTCA TCAACCAACT GGGCGCGGGG ATGCTGCTGC CGACGCTGCT CGTCTGGTCG ATGAGCATCC TGCCCTTCGA AGTGCGCGGG CGCGGGACCG GTTTCTGGCA ATCGTCGTTC GCGCTCGGCC AGTGGTTGAG CCCGCTGGCG GTCACTTTCT TCGCGCTGCA CCTGGGGGGT CTGATGAAAA GCTTCGAGAT GCTCGGCTAC ATGGCTGCCG CTGGCTTCGT CGTCGCGCTC GTGGCGGGTC GCAAGACCGC GGTGGCGGCC AATGCCTGA
|
Protein sequence | MGRQHEPGVM TGIALLLPIT LTTMAIVLLA PVLPQLMAEF ADVPGHEYLV PMVLTLPSLC VAVLCPFAGM LGDYFGRRQL LLASFVLYAV VGVAPLFLTD LVHILISRIG VGVAEAMIYV LSTTMIGDYY EGERRDRWLA GQTAFASVSA LVFFNIGGLL GEAGWRTPFI VYASALVMFA LVLKFTWEPS GDKAEIADAT KASAAPHNLS WAMFPWGNLA LIAGITVYGA IFFYTVQIQA SAGLAELGLS SAARIGFLTS LASLGVPLGT FIYSRLGRIG VGKLLLAEFG ILSIGFLLMG RTGSVPGFLA GCFINQLGAG MLLPTLLVWS MSILPFEVRG RGTGFWQSSF ALGQWLSPLA VTFFALHLGG LMKSFEMLGY MAAAGFVVAL VAGRKTAVAA NA
|
| |