Gene Haur_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2001 
Symbol 
ID5733890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2480071 
End bp2481348 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content51% 
IMG OID641279145 
Productmajor facilitator transporter 
Protein accessionYP_001544772 
Protein GI159898525 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTTG ATACGCTCAA ACCCACAACG ACCAACCCAA TCGATAACTG GAGCCACATT 
GTTTGGAAAC CACGCTTTTT TGCCATTTGG CTGGGTCAAG CTAGCTCGTT GGTCGGCAGC
GCCCTGACCC AATTTGTATT AATATGGTGG ATCACCCAAA CCGTCGGCAC GGCCCAAGCT
TTATCGCTCG CTGGCATGAT GGCCTTGCTG CCACAAGCAG TCTTTGGGCC AATTGGCGGT
ATCATCGCCG ACCGCTGGAA TCGCCGCCTG ATTATGATTA GCAGCGATCT GATTTCAGCC
ATCAGTATGG TTATTTTGAT TGTGCTTTTT GCGACCGAGC AGATTCAGCT TTGGCATATC
TACACGCTGA TGTTTTTGCG CAGCACGATG CAGGCCTTTC AAAGCCCTGC CGCGACGGCC
AGCACCAGCC AACTTGTACC GCCCGATTGG CTAACGCGTG CAGCTGGCAT GAACCAGATT
ATTTTGGGCT TGATGAGTGT GGCGGCGGCT CCACTTGGCG CATTGGCGAT GAGCTTATTT
TCTCTTGAAG GCGCGTTGAT GATCGATGTG GTCACTGCGC TACTCGCAAT TACACCATTA
TTGTTCTATA AAGTGCCCCA AACTCGTCAG GCTACCGAGG ATCAAGCCAG CATGTGGCAC
GATTTTCGCA GCGGCTTCAG CATGATTCTG CACCATCGCG GCTTAACTTT GATGTATGGA
CTAACTTTGT TGATGGTGGC GGTGTTGATT CCAACCTTCG TGCTGACTCC GTTGTTGATT
CAGCAAGAAT TTGGCGGTGG AGTTGAGCGG GTTGCTTTGA TGGAAGGTAT GGGCGGTTTG
GGCATGTTAA TCGGTGGTCT GATGATCAGC ATCATGCAGT TTTCCATGCG CCGAATTGTT
TTGGTGTTGG TGATGTTTGC GCTCTCGTCA GCGATGGTTG GCTTGGCGGG GCTTGTGCCT
AGCTCGCTGT TTTGGGTAGC AGTGGTTTTA TGGTTTATCA GTGGGGTAAC CTATACCATC
GGCAATGCAC CAATTATTGC AATCGTCCAA ACAATTGTGC CCAATCAAAT GCAGGGCCGT
GCCCTCTCAC TCTATTCAAC CATGATCGGC CTAGCTGGCC CGCTGACCTT GCCCCTGACC
GGACCACTCA GCGAATTGAT CGGCATTCGC ATGATCTTTA TTGGCGGTGG TTTTATTGCC
GCCTTGGTGT GCTGTTTGGC TTTTCTATCA CCCAGCATTT TACAAATTGA GCAAACGCCA
ATCGCCACAC ATGACTAA
 
Protein sequence
MSLDTLKPTT TNPIDNWSHI VWKPRFFAIW LGQASSLVGS ALTQFVLIWW ITQTVGTAQA 
LSLAGMMALL PQAVFGPIGG IIADRWNRRL IMISSDLISA ISMVILIVLF ATEQIQLWHI
YTLMFLRSTM QAFQSPAATA STSQLVPPDW LTRAAGMNQI ILGLMSVAAA PLGALAMSLF
SLEGALMIDV VTALLAITPL LFYKVPQTRQ ATEDQASMWH DFRSGFSMIL HHRGLTLMYG
LTLLMVAVLI PTFVLTPLLI QQEFGGGVER VALMEGMGGL GMLIGGLMIS IMQFSMRRIV
LVLVMFALSS AMVGLAGLVP SSLFWVAVVL WFISGVTYTI GNAPIIAIVQ TIVPNQMQGR
ALSLYSTMIG LAGPLTLPLT GPLSELIGIR MIFIGGGFIA ALVCCLAFLS PSILQIEQTP
IATHD