Gene Haur_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0984 
Symbol 
ID5732887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1126833 
End bp1128278 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content49% 
IMG OID641278118 
Productmajor facilitator transporter 
Protein accessionYP_001543760 
Protein GI159897513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.829965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACA CACCCATGCC ATCACAGCAT TCCACCGTTG TCTCGCAATC CACAATTTGG 
AAGGTCATCA TTGCATCGGC TTTGGGTACC ATGATTGAAT GGTACGATTT CTACATTTTT
GGCAGTCTTG CTGCGGTCAT TGCGACAAAT TTCTATCAGT CGGGCAACGA AACGGCGGCC
TTGCTGGAAA CCTTTGCCAC CTTTGGGGCG GGCTTTGCGG CGCGACCGTT CGGGGCACTG
GTGTTTGGGC GCATTGGCGA TATTGTTGGG CGCAAATATG CCTTTTTGGT CACGTTGCTG
ATCATGGGCG GCGCAACCAC GGTCATTGGG ATTTTGCCCA CCTATGCCTC AATTGGCATC
CTCGCCCCGA TCATTTTGGT GATTATTCGG ATCATCCAAG GTTTGGCGCT TGGTGGTGAA
TATGGCGGTG CGGCGGTCTA TGTCGCTGAA CATGTTCCCG ACCATAAGCG CGGTTTTTAC
ACCAGTTTTA TTCAAATTAC CGCCACGCTT GGCTTATTTA TCTCGTTGCT GGTAATTTTG
ATTGTACGAA CCTCGATGAG CAAGGCAGCC TTTGATAGCT GGGGCTGGCG GATTCCCTTC
TTGCTCTCAA TTGTCTTGGT GGGTGTTTCA GTCTACATTC GCTCGAAGAT GAGTGAATCG
CCGTTGTTTA CCAAACTCAA ACATGCAGGC AAAACCTCGA AAGCTCCGCT TAAAGATAGT
TTTGGCAATC GGCGCAATTG GAAAGTGATT TTGACGGTGT TGTTTGGAGC CGCTGCGGGT
CAAGCAGTAA TCTGGTATAC CGCTCAATTT TACGTGAACT CGTGGCTCAA AACCCAAGCC
AAAGTGCCAG CTAACACCGT TGATACAATC GTGGCGATTG CTTTGTTCTT AGGCATGCCG
TTTTTCGTCG TCATGGGAGC GCTTTCAGAT AAATGGGGGC GCAAAACGGT GATGATGGCA
GGCAATTTAA TCGGTGCAAT TGCGATTTAT CCCGCCTTTA TGGCCCTGAA AGCGGCGGCT
GGTCCAATTA CTCCGGCGGT TCTCGATGAA GCTGGAAAGG TTATCACGCC TGCGGTCGCC
AACAATCCTA ACACCGTTCT ACTCACCTTG ATCATTTTTG GGTTGGTGTT GTGTGTTTGT
ATGGTGTATG GCCCGATTGC GGCCTTTTTG GTGGAATCGT TTCCTGCCAA AATTCGCTAT
ACCTCGGTTT CACTGCCCTA TCATGTTGGC AACGGCTACT TTGGCGGTTG GTTGCCCTTT
ATCGCCACAG CAGTGGTTAG TAGTACCGGC AATATCTATG CTGGCCTATG GTTTCCAATT
GCCATCGCTT TGTTGACCTT TGTGGTTGGG ATGGTCTTGC TCAAGGAAAC CAAGGATAAT
TCGCTGCATG AAGAGGCTAG CGATAACCCA ATGGCGACTG AAATGGATTT AATTGCCCAA
TCATAA
 
Protein sequence
MSNTPMPSQH STVVSQSTIW KVIIASALGT MIEWYDFYIF GSLAAVIATN FYQSGNETAA 
LLETFATFGA GFAARPFGAL VFGRIGDIVG RKYAFLVTLL IMGGATTVIG ILPTYASIGI
LAPIILVIIR IIQGLALGGE YGGAAVYVAE HVPDHKRGFY TSFIQITATL GLFISLLVIL
IVRTSMSKAA FDSWGWRIPF LLSIVLVGVS VYIRSKMSES PLFTKLKHAG KTSKAPLKDS
FGNRRNWKVI LTVLFGAAAG QAVIWYTAQF YVNSWLKTQA KVPANTVDTI VAIALFLGMP
FFVVMGALSD KWGRKTVMMA GNLIGAIAIY PAFMALKAAA GPITPAVLDE AGKVITPAVA
NNPNTVLLTL IIFGLVLCVC MVYGPIAAFL VESFPAKIRY TSVSLPYHVG NGYFGGWLPF
IATAVVSSTG NIYAGLWFPI AIALLTFVVG MVLLKETKDN SLHEEASDNP MATEMDLIAQ
S