Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4661 |
Symbol | |
ID | 5736508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5958232 |
End bp | 5959596 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281825 |
Product | major facilitator transporter |
Protein accession | YP_001547420 |
Protein GI | 159901173 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.731341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAG CAACCAGAAA GCAACCGAGC GGCATGCTCG CATTTAGCAT TATGTGGTTT GGCCAAGTCG TTTCATTGCT TGGCAGCTCC ATGAGCAGCT TTGCCCTGAC GATTTGGGCT TGGCAAATTA CAGGTCAAGC CACAGCCTTG GCGCTCGTAG GCTTTTTCTC GTTTGCCCCA AGCATTATTG TTAGCCCCTT TGCCGGAGCC TTGGTCGATC GCTGGAATCG TAAGCTGGTG CTGATTTTGA GCGATTTAGC CACAGGCTTA TCGACGATCG CTATTTTATT GCTCTACCAC AACGATGTAC TGCAAATTTG GCATTTGTAT GTGGCTGGAG CATTTGCCAG CATTTTTCAA TCGTTTCAAT GGCCAGCCTA TTCGGCGGCA GTTTCGACGA TGTTGCCCAA ACAGCACTAT GCTCGCGCCA GCGGCATGAT GTCGATGGCC GAATCGGCAG CGGGAATTGT CGCGCCAGCC CTAGCCGGCT TTTTGCTAAC CGTAATGGGC ATTGGCGGTA TCTTGATTAT TGATATTGTG ACGTTTGTGT TTGCCGTCAG TGCAGTGCTC TTTGTTAATA TTCCCCAACC AACGCAGAGC GAGGCGGGGG CGCAAGGCAA GGGCAGTTTA TGGAGCGAGG CAGGCTTTGG CTTTCGCTAT ATTTTGGCAC GCCCCAGCCT CTTGGGCTTG CAACTGACCT TTTTTATGAT TAACTTCGTT GGTTCGTTTG AAGCCACCAT GACCGCTCCC ATGATTCTGG CCCGCACCGA TAGTAACTCG GCAATTATGG GCACGGTACA ATCGGCAATG GGCATTGGCG GAGTGATCGG CGGCTTGATC CTGAGTGTCT GGGGCGGCCC CAAACGCAAA GTTCATGGCG TGCTCGGTGG AATGGCGCTC TCCAGCTTTT TTGGCGGCAT TCTCATGGGC TTGGGCCAAA ATACGCTGGT TTGGTCGATT GCGGGCTTTG GTTTGCTATT CGTGCTACCA ATGTTGAATG GCTCGAATCA AGCGATTTGG CAAGCCAAAG TGCCACCCGA TATCCAAGGG CGGGTGTTCG CAGTGCGACG CATGATCGCC CAAATTTCGG GGCCAATCGC AATTTTGATC GTTGGACCAT TGGCCGACAA AGTGTTTGAG CCACGTATGG CGGTTGGCGG CGCTTGGGTC GATATGTTTG ACAGTTGGGT TGGCAGTGGC AAAGGGGCGG GCATCGCCTT AATTATGGTC TTGAGTGGCA TTGTTGGCAT CGCCGTAGCC GTGATCGCTT ATGGCGTGCG AGTCGTGCGC CACGCCGAGG ATCTAATTCC TGACCATCAA GATAGCCCAA GTAGCAGCCC TGAACTGCAA GCCGAACCAG CCTAA
|
Protein sequence | MAEATRKQPS GMLAFSIMWF GQVVSLLGSS MSSFALTIWA WQITGQATAL ALVGFFSFAP SIIVSPFAGA LVDRWNRKLV LILSDLATGL STIAILLLYH NDVLQIWHLY VAGAFASIFQ SFQWPAYSAA VSTMLPKQHY ARASGMMSMA ESAAGIVAPA LAGFLLTVMG IGGILIIDIV TFVFAVSAVL FVNIPQPTQS EAGAQGKGSL WSEAGFGFRY ILARPSLLGL QLTFFMINFV GSFEATMTAP MILARTDSNS AIMGTVQSAM GIGGVIGGLI LSVWGGPKRK VHGVLGGMAL SSFFGGILMG LGQNTLVWSI AGFGLLFVLP MLNGSNQAIW QAKVPPDIQG RVFAVRRMIA QISGPIAILI VGPLADKVFE PRMAVGGAWV DMFDSWVGSG KGAGIALIMV LSGIVGIAVA VIAYGVRVVR HAEDLIPDHQ DSPSSSPELQ AEPA
|
| |