Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2001 |
Symbol | |
ID | 5733890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2480071 |
End bp | 2481348 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279145 |
Product | major facilitator transporter |
Protein accession | YP_001544772 |
Protein GI | 159898525 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTTG ATACGCTCAA ACCCACAACG ACCAACCCAA TCGATAACTG GAGCCACATT GTTTGGAAAC CACGCTTTTT TGCCATTTGG CTGGGTCAAG CTAGCTCGTT GGTCGGCAGC GCCCTGACCC AATTTGTATT AATATGGTGG ATCACCCAAA CCGTCGGCAC GGCCCAAGCT TTATCGCTCG CTGGCATGAT GGCCTTGCTG CCACAAGCAG TCTTTGGGCC AATTGGCGGT ATCATCGCCG ACCGCTGGAA TCGCCGCCTG ATTATGATTA GCAGCGATCT GATTTCAGCC ATCAGTATGG TTATTTTGAT TGTGCTTTTT GCGACCGAGC AGATTCAGCT TTGGCATATC TACACGCTGA TGTTTTTGCG CAGCACGATG CAGGCCTTTC AAAGCCCTGC CGCGACGGCC AGCACCAGCC AACTTGTACC GCCCGATTGG CTAACGCGTG CAGCTGGCAT GAACCAGATT ATTTTGGGCT TGATGAGTGT GGCGGCGGCT CCACTTGGCG CATTGGCGAT GAGCTTATTT TCTCTTGAAG GCGCGTTGAT GATCGATGTG GTCACTGCGC TACTCGCAAT TACACCATTA TTGTTCTATA AAGTGCCCCA AACTCGTCAG GCTACCGAGG ATCAAGCCAG CATGTGGCAC GATTTTCGCA GCGGCTTCAG CATGATTCTG CACCATCGCG GCTTAACTTT GATGTATGGA CTAACTTTGT TGATGGTGGC GGTGTTGATT CCAACCTTCG TGCTGACTCC GTTGTTGATT CAGCAAGAAT TTGGCGGTGG AGTTGAGCGG GTTGCTTTGA TGGAAGGTAT GGGCGGTTTG GGCATGTTAA TCGGTGGTCT GATGATCAGC ATCATGCAGT TTTCCATGCG CCGAATTGTT TTGGTGTTGG TGATGTTTGC GCTCTCGTCA GCGATGGTTG GCTTGGCGGG GCTTGTGCCT AGCTCGCTGT TTTGGGTAGC AGTGGTTTTA TGGTTTATCA GTGGGGTAAC CTATACCATC GGCAATGCAC CAATTATTGC AATCGTCCAA ACAATTGTGC CCAATCAAAT GCAGGGCCGT GCCCTCTCAC TCTATTCAAC CATGATCGGC CTAGCTGGCC CGCTGACCTT GCCCCTGACC GGACCACTCA GCGAATTGAT CGGCATTCGC ATGATCTTTA TTGGCGGTGG TTTTATTGCC GCCTTGGTGT GCTGTTTGGC TTTTCTATCA CCCAGCATTT TACAAATTGA GCAAACGCCA ATCGCCACAC ATGACTAA
|
Protein sequence | MSLDTLKPTT TNPIDNWSHI VWKPRFFAIW LGQASSLVGS ALTQFVLIWW ITQTVGTAQA LSLAGMMALL PQAVFGPIGG IIADRWNRRL IMISSDLISA ISMVILIVLF ATEQIQLWHI YTLMFLRSTM QAFQSPAATA STSQLVPPDW LTRAAGMNQI ILGLMSVAAA PLGALAMSLF SLEGALMIDV VTALLAITPL LFYKVPQTRQ ATEDQASMWH DFRSGFSMIL HHRGLTLMYG LTLLMVAVLI PTFVLTPLLI QQEFGGGVER VALMEGMGGL GMLIGGLMIS IMQFSMRRIV LVLVMFALSS AMVGLAGLVP SSLFWVAVVL WFISGVTYTI GNAPIIAIVQ TIVPNQMQGR ALSLYSTMIG LAGPLTLPLT GPLSELIGIR MIFIGGGFIA ALVCCLAFLS PSILQIEQTP IATHD
|
| |