Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0796 |
Symbol | |
ID | 5732681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 899040 |
End bp | 900215 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277927 |
Product | major facilitator transporter |
Protein accession | YP_001543572 |
Protein GI | 159897325 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCT CTCCACTTTT GACCATTTTT TTGATCGCGT TTGTTGGAAC CATGAGCTAT GGTGTGGTGA TTCCGATCAC GCCGTTTTAT GCGCAGGAGT TTGGGGCTTC CGAGGTTCAG GTGGGCATGA TTGTAGGCAG CTATGCCTTG ATGCAATTTA TCTTTGCCCC GATCCTTGGC CAATTATCCG ACCGCTATGG TCGTCGCCCA CTGCTGATTT TAAGTTTGAT TGGCACGGTT TGTAGTTTGT TGCTATTTGG TTTTGCCAAT AGCCTGATTT GGCTGTTCGT CGGGCGCATG TTCGATGGCG CAACTGGCGG TAACATCTCG ATTGCCCAAG CCTATGTTAG CGATATCACC ACCGACAAAG ATCGTGCTCG CGGGATGGGC ATGGTTGGGG CGGCACTTGG CTTAGGCTTT ATCGCTGGCC CAGCCATCGG CGCGTTGCTC AGCAAAGATG GCAATTATCA GTTGCCAATT TTCGTAGCCG CAGGCATTGC AGTGCTCAGC CTGATTTTAA CGATTGTGGT ATTGCCTGAG CCAGAGCGCC ATGCACCTCA ACAAGGCCGT ACTTTTAACC CAATGAAACT GCTGGCGGCA GTTCGCAAGC CCAATGTTGG CCGTTTGCTC AGTATTACCT TGTTGATCAA CTTGGCATTT GTGGCCTTTG AAACAACTTT TGCCTTGTTT GCGGCGCGAC GGTTGGAGTT TGGCTCGCAT CAAACAGGCT ATACTTTGGC CGGGGTTGGG ATTGTGGTCG CGATTGTGCA AGGCGGCTTA ATTCGCCGTT TGGCGGCGCG GTTTGGCGAA GCAACCCTGA TTGTGTCTGG CTCGTTGCTG CTCGCGCTTT CGTTGGCGGG CTTGGGCTTT ATTCAAAATG TGTGGCATTT GGTGGCAATT TGTATTGTGC TGGCAGTTGG CGAGGGCTTG CTCACGCCAT CGCTTTCGTC GTTGGTCAGC CGCAATTCAC CTGCTAGCGA GCGCGGCGAG AATATGGGCT TGTATCAGTC GATGAGCAGT TTGGCGCGGA TTTTTGCCCC GCTCTATGCC ACCTGGATGC TCTCGAACGT TGGCGAAGCC TCGCCCTACC TGATGGGCAG CGTGTTGGTT GTGGCAGGCG CATTAATTGC GGTTGGCTTG CCTAGCCCTG AACCGCAAGC CCAGCCAGCG CATTAG
|
Protein sequence | MKRSPLLTIF LIAFVGTMSY GVVIPITPFY AQEFGASEVQ VGMIVGSYAL MQFIFAPILG QLSDRYGRRP LLILSLIGTV CSLLLFGFAN SLIWLFVGRM FDGATGGNIS IAQAYVSDIT TDKDRARGMG MVGAALGLGF IAGPAIGALL SKDGNYQLPI FVAAGIAVLS LILTIVVLPE PERHAPQQGR TFNPMKLLAA VRKPNVGRLL SITLLINLAF VAFETTFALF AARRLEFGSH QTGYTLAGVG IVVAIVQGGL IRRLAARFGE ATLIVSGSLL LALSLAGLGF IQNVWHLVAI CIVLAVGEGL LTPSLSSLVS RNSPASERGE NMGLYQSMSS LARIFAPLYA TWMLSNVGEA SPYLMGSVLV VAGALIAVGL PSPEPQAQPA H
|
| |