Gene Haur_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3332 
Symbol 
ID5735202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4200736 
End bp4201971 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content51% 
IMG OID641280479 
Productmajor facilitator transporter 
Protein accessionYP_001546096 
Protein GI159899849 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAAC AATTCAAAGA GTATGCGATT GTTACCGCCG CTTATTGGGC CTTTACATTG 
ACCGATGGCG CGTTGCGTAT GTTGGTCTTG TTTCATTTTC ATCAACTTGG CTATAGTGCC
TTGGCGGTGG CGGCGCTCTT TATTTTCTAT GAAGTATTTG GGGTGATTAC CAATTTGTTT
GGTGGCTGGC TGGGCGCACG CTTTGGCCTG AATCGCACCT TACATGCTGG CTTGGTGTTA
CAAGTTATCG CTTTGGGCAT GATGGTTGTG CCCAGTGCTT GGCTCAGCGT GACATATGTG
ATGCTAGCGC AAGCTGGCTC GGGCATCGCC AAAGATCTGA CCAAAATGAG TGCCAAGAGC
AGCATCAAAA TGTTGGTGGC TGAAGATGCT CAATCAACTT TGTTCAAATG GGTAGCTATT
TTGACTGGCT CGAAAAATGC GCTCAAAGGT GTGGGCTTTT TCTTGGGCAG CCTATTGCTG
AGCACCATTG GCTTTCGCAG TGCCTTCGCT GTGCTTGCGA GTTTGATTGC CGTAGCTGCC
ATTGGCACAT TCAGCTTTTT GCAACACGAT TTGGGGCGTA GCAAAGCCAA GCCGAAGTGG
CAACAACTAT TTGCCAAAAG CCGCGCGATC AATCTGCTTT CAGCAGCGCG TTGTTTTTTG
TTTGCAGCGC GGGATGTCTG GTTTGTGGTC GGCTTGCCAG TTTTTCTGAG TGAAGTGCTT
GGCTGGTCGT TTTGGCAGGC TGGCGGCTAT TTGGCCTTAT GGGTGATTGG CTATGGCCTG
ATTCAAGCGC TTGCGCCAGC GATTACCCGC CGCTGGCAAG CCACGCCCAA CGGCTGGGCT
GCAACAATTT TGGCTGGCTT GTTGATGCTG ATTATGGCTG CGATTGCCAG TGTTGTGCAA
ATCAACCAAC AATCGGCTAT CGTGGTTGTG GTCGGCTTAA TTGGCTTTGG ATTTATTTTT
GCGCTGAATT CGGCAGTCCA CTCCTACCTG ATTTTAGCCT ACACTGACCA AGCCGATGTG
GCTTTGAATG TAGGTTTTTA CTATATGGCC AACGCCTTGG GCCGCCTGCT TGGCACAATT
CTTTCGGGTC TGCTGTATCA AAGCTATGGG TTGGCAGGCT GTTTATGGGC AGCGGCCTTA
CTGAGCGCAA TCACAGCCGT GATCTCGTTG AGCTTGCCGC GTGTGCCAAA CCAACCGCTG
TTGGGCGAAC AGCAGGCCAA AGCTAGCTCG ATCTGA
 
Protein sequence
MQQQFKEYAI VTAAYWAFTL TDGALRMLVL FHFHQLGYSA LAVAALFIFY EVFGVITNLF 
GGWLGARFGL NRTLHAGLVL QVIALGMMVV PSAWLSVTYV MLAQAGSGIA KDLTKMSAKS
SIKMLVAEDA QSTLFKWVAI LTGSKNALKG VGFFLGSLLL STIGFRSAFA VLASLIAVAA
IGTFSFLQHD LGRSKAKPKW QQLFAKSRAI NLLSAARCFL FAARDVWFVV GLPVFLSEVL
GWSFWQAGGY LALWVIGYGL IQALAPAITR RWQATPNGWA ATILAGLLML IMAAIASVVQ
INQQSAIVVV VGLIGFGFIF ALNSAVHSYL ILAYTDQADV ALNVGFYYMA NALGRLLGTI
LSGLLYQSYG LAGCLWAAAL LSAITAVISL SLPRVPNQPL LGEQQAKASS I