Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4799 |
Symbol | |
ID | 5736644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6116933 |
End bp | 6118150 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281965 |
Product | major facilitator transporter |
Protein accession | YP_001547558 |
Protein GI | 159901311 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.896954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAGG TTCCTTCGAC GCATGTGCGT CATCCAATGT GGCCAATTTA TCTCTGCGTT GCGGTTACTG CCCTTGGAAT TGGGATCATC AACCCGTTGA TTCCGCATTT ACTTGAGCAA AACGGCGCTA ATGAATTTAT TGTGGGCTTG AGCACCAGTG TGATGTTTGC CAGCCTTGTG CTAACTGGCA TGCCGATTGG CCGCACAATC GATAAATTTG GCATTCGACT ATTTTTGATC GTTGGCTTAA TTTCCTACAC CGTTGCTATG CTGGCAATGC CTTGGACGCA TAGTATCCCG CTCTTTTTCA TGTGGCGGAT TTTCGAGGGG ATTGGTTGGT CGTGTGTTTG GACTGCCGCT GAAACCTATG TCAGTCGGGT TAGTTCGCCT GCACAACGAG GGCATAACAC CGCCGTCTAT GGCATGTCGC TGGCCAGTGG TACAGCAGCA GGTCCAATCA TCGGCACAGC GGTTTATGAA TGGACCAAAA ACCCTGTCCA TCCATTTTTG GTAGCAGTCG CGGCAGCGAT TATCGCGACC GTGATTGTGG TCTTGGTTGT GCCAGAACCG CATGTTCATC ATACAGAGCA GGAAAGTGGT GGTGTCAGCT TTAAAATCAC CCGCCCATTA ATTTTGCCTT TGGCGATTGC CTTCCTGTAT GGTTATGGCA CGTTGTCGCT GGTTTCATTG CTACCAACCC TCAATTATAG CAACTGGGAG CTGGGTACGC TGATTTCGGT AACGGTCATT GCCAACATTG TGGCCCAAGT GCCAATTGGC CGAATGCTTG ATCGCTATGG CTATCGGCCA TTACTAATTG GCTCGTTATC GCTCCTTTCG GCGGCAGCCT TGTTCTCGAC CCTGCACCCA CCGTTCTTGC TAACCCTATT TCTCGGGATG TTGCTGGGCG CATTCGCTGG CACGCTCTAT CCAATTGGGT TGGCTGTGTT AGCAGCGCGG GTTCCACCAG CCAAATTGGG CGGTGCTAGC GGCATGTTTA CGGTTTGCTA TGGGCTTGGC AGTTTTGTTG GGCCTGCGCT CACAGGCGGC GTAATGTCCA TGGTTGGTGC GAAACATAGC GACCAAGCCT TGTTTGGCAC GATTGGCTGT TTGGTGCTCG CACTCTTGGG CTTAATGATG ACCGGAATCG ATCGGGTCAA CGTCAAAGAG GAACATCCAG TTAGCGTAGC CTCAAGCGGC ATCAAGCCAC CAATGTAA
|
Protein sequence | MQQVPSTHVR HPMWPIYLCV AVTALGIGII NPLIPHLLEQ NGANEFIVGL STSVMFASLV LTGMPIGRTI DKFGIRLFLI VGLISYTVAM LAMPWTHSIP LFFMWRIFEG IGWSCVWTAA ETYVSRVSSP AQRGHNTAVY GMSLASGTAA GPIIGTAVYE WTKNPVHPFL VAVAAAIIAT VIVVLVVPEP HVHHTEQESG GVSFKITRPL ILPLAIAFLY GYGTLSLVSL LPTLNYSNWE LGTLISVTVI ANIVAQVPIG RMLDRYGYRP LLIGSLSLLS AAALFSTLHP PFLLTLFLGM LLGAFAGTLY PIGLAVLAAR VPPAKLGGAS GMFTVCYGLG SFVGPALTGG VMSMVGAKHS DQALFGTIGC LVLALLGLMM TGIDRVNVKE EHPVSVASSG IKPPM
|
| |