Gene Haur_2125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2125 
Symbol 
ID5734013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2669021 
End bp2670544 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content53% 
IMG OID641279266 
Productmajor facilitator transporter 
Protein accessionYP_001544893 
Protein GI159898646 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAG CACGTCGCTG GCCATGGGGC TTAATTTGGA CAGCCCTGAT TATTTTTTTG 
GCCGCTCTCG ACCAAACGGT GGTGATCACC GTCTTACCAA ATGTGGTCAG CACGCTGGGC
CTTGATGTTG AGCAAGCGCT TGAACAAGGC ATTTGGGTCA TCACTGGCTA TTTGTTGGGC
TATACCGTCG CTATGCCCTT GCTAGGACGG ATCGCCGATG CCTATGGTCA TCGGCGCTTA
TTTTTGGCGG CGCTCGGGGT TTTCGTGGGC GGTTCAATTG GTTGTGCCTT GGCAGATAGC
GTTTGGTCGT TGGTGGCATG GCGCATCGTT CAGGCGATTG GTGGTGGCGC TGTGTTGCCG
ATCGGTTTGG CAATCTCGAT GGATGAAGTT AAGCCAATTC ATCATGCAAC TGCCTTGGGG
ATTATGGGGG CCGCCGGCGA AGCTGGCGGG GTGCTTGGCC CAGCCTATGG TGGCCTGATT
TCACAAATCC AACTGCTCGA TGTTGATGGT TGGCGTTGGG TTTTCTGGCT GAATATTCCA
CTCGGCGCAG CTTTGGCTTG GGCAATTATT CGCACCTTAC CTGATCGGCC TGGTAATCGC
GGGGCGATTG ATTATATTGG CGGTGGCTTG ATTGCGGTTA GTTTGACCGC CTTAACTGTG
GCGCTTTCAC GCTCGCTCGG TAGTTTGGCC ATCGAGCCCA GCGCCGAAAG CGGCAACCTC
GATCAATATG CAGTGCAATG GACATCACCA TTAACCATTG GCCTGTTGGT GTTGGCAGTG
CTCAGTTTTA TTGGCTTTAT CTGGTGGGAA CGCCGCACGA CAACTCCATT GATCGAGCTA
AGCGCCTTCC GCACCCCAGC ATTTAGCGCT GCCAATATTA CCAATGTCTT GGTTGGCATG
GCCTTAATTG TGGGCATGGT CAATGTGCCG TTTTTCGTCG GAACAGTGTT GGCAGGCGAT
GCGCTTTCTG GTGGATTAAC GTTAATGCGC CTGACCATGA TGATCCCGAT TGGTGCAGTT
TTGGGTGGTA TGTTGATGCG CAAGATTAGT GCGCGGTTGG TTGCTAGCCT CGGCATGATC
ACCACCGCCG TTGGTTTTGC ATTGCTGGGC TTTTGGAAAG CAGAAACCAA TCAATTCCAA
TTAACCATCT ATTTATTGTT AACGGGTACA GGCTTTGGTT TAGTACTCCC CGCCTTGAGT
GCTGCCGCCA TTGGCACCGT TGCCCGTGAA TCAATGGGCA CTGCCGCAGG CTTATTGAAT
GCATTACGCA TGGTCGGAAT CACCTTGGGT GTTTCGGCTT TGGCTTCATG GAGTTTGGCC
TACCGCGCTA GCCTCAATAG TACGTTGGTG TTTACAATGG AAGATTTCAA TACAGGCGCG
GCCCAATTGG CCCTAACCCA AAATGAAATG ACGGTGTATC ACAGCACCTT TTTCGCCGCC
GCAGTAGTTT GCTTAATTGC GTTGATTCCA ATTTGGTGGC TGCCACGCGA ACGCAGCGAG
GGCGACACAC CACTGTTTGC CTAG
 
Protein sequence
MTEARRWPWG LIWTALIIFL AALDQTVVIT VLPNVVSTLG LDVEQALEQG IWVITGYLLG 
YTVAMPLLGR IADAYGHRRL FLAALGVFVG GSIGCALADS VWSLVAWRIV QAIGGGAVLP
IGLAISMDEV KPIHHATALG IMGAAGEAGG VLGPAYGGLI SQIQLLDVDG WRWVFWLNIP
LGAALAWAII RTLPDRPGNR GAIDYIGGGL IAVSLTALTV ALSRSLGSLA IEPSAESGNL
DQYAVQWTSP LTIGLLVLAV LSFIGFIWWE RRTTTPLIEL SAFRTPAFSA ANITNVLVGM
ALIVGMVNVP FFVGTVLAGD ALSGGLTLMR LTMMIPIGAV LGGMLMRKIS ARLVASLGMI
TTAVGFALLG FWKAETNQFQ LTIYLLLTGT GFGLVLPALS AAAIGTVARE SMGTAAGLLN
ALRMVGITLG VSALASWSLA YRASLNSTLV FTMEDFNTGA AQLALTQNEM TVYHSTFFAA
AVVCLIALIP IWWLPRERSE GDTPLFA