Gene Haur_2594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2594 
Symbol 
ID5734472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3329024 
End bp3330211 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content52% 
IMG OID641279734 
Productmajor facilitator transporter 
Protein accessionYP_001545360 
Protein GI159899113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACGC GGACAAGTGG GCAGGCTGTC GCTCTGACAG GCCTTTCGGT CATGGTAATG 
ATGACATTTT CGCATGCCAT GAATGATATG TGGACTTCAC TGTTAGCGCC CTTACTGCCA
AGTATTCGCG ATACCTATCA GGTGAGTATT GGTCAAACTG GCATTTTGGT GGCGATTTTG
TCGTTTGCTG GCTCGATGCT TCAGCCCTTG CTTGGTGCGG TTGGCGATTA TATTGATCGG
CGTTGGTTAG CGGCATTTGG CCCTGTGCTG ACGGCGATCG GCCTAACCTT GATTGGCTAT
GTGCCCAATT TCTTTATGCT GGGCGCGTTG ATTATGCTTG GTGGTTTGGG CAGCGCAATT
TTTCATCCGG CAGGAGCAGC CTATATCGCC ATGGGCGCGA ATCCTCAGCA ACGTGGTTTG
TTTGTTTCAA TTTTTTCGGC TGGCGGCACG GTTGGCATGG CCTTTGGCCC CCTAATTGCT
GCCCAGTTTG ATTTGGTGAG TTTGCCCTAT TTGCTGCCCG TGGGAATTGC AGTTGGGGTT
TTGACCTTCT TGATGATTCC TTCAGCCAAG CAAAATCGCA GCCAACCCAA AACGTTGCGC
GATTATATCA GCGTTTTTCA GGGGCCGTTG CGCTGGCTTT GGTTTATGAG CGTCTTACGC
TCGCTTTCGA GTGTTTCATA TAGCAGCTTA TTGGGCTTTA TGCTGCGCGA TCGTTTTGAT
CAAGCGATGG CTGATGCCCA TGTTGGCCCG ACGTTGGCGG TTTTCAATAT TGCCTCAGCG
GTTGGCGGCA TTATTGGCGG ACGCATTTCT GATCGAATTG GGCGCACAGT GGTGCTGCGT
TCAAGTATTT TGAGCACAAT TCCGCTTTTT ATCGGCTTAG TGCTATCATC GCCATTGAAT
TGGTGGTATT ACCCCTTGAC GGCACTGGTT GGGGCAATGG TGATGGCTAA TATTCCGGTT
TCGATTGTCA CAGCGCAGGA GTATGCACCG CAACATATTG CCACCGCCAG CGCCATGATG
ATGGGCTTTG CTTGGGGTAC GTCGGGCGTG CTTTACCCCA TCATTGGCAG CCTCGCCGAC
TGGACCTCGC CAACCTGGGC CATGATCGCC GCGATTGGCT TGTTATTGCC AGCCTTCTTT
ATCACGGTAC GGCTGCCCGA GCCTGAGCGC ACAACGACGA TAGGGTAG
 
Protein sequence
MATRTSGQAV ALTGLSVMVM MTFSHAMNDM WTSLLAPLLP SIRDTYQVSI GQTGILVAIL 
SFAGSMLQPL LGAVGDYIDR RWLAAFGPVL TAIGLTLIGY VPNFFMLGAL IMLGGLGSAI
FHPAGAAYIA MGANPQQRGL FVSIFSAGGT VGMAFGPLIA AQFDLVSLPY LLPVGIAVGV
LTFLMIPSAK QNRSQPKTLR DYISVFQGPL RWLWFMSVLR SLSSVSYSSL LGFMLRDRFD
QAMADAHVGP TLAVFNIASA VGGIIGGRIS DRIGRTVVLR SSILSTIPLF IGLVLSSPLN
WWYYPLTALV GAMVMANIPV SIVTAQEYAP QHIATASAMM MGFAWGTSGV LYPIIGSLAD
WTSPTWAMIA AIGLLLPAFF ITVRLPEPER TTTIG