Gene Haur_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3349 
Symbol 
ID5735219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4223687 
End bp4225012 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content50% 
IMG OID641280496 
Productmajor facilitator transporter 
Protein accessionYP_001546113 
Protein GI159899866 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATGGA AATTAGCACT TCATCGGCCT ATTGCTCCGC GTGCGAGCAA TGAAGAAACT 
TTGCGCCGTA ACATGCGCTT AGGAGTGGCC AATGGAGTTT TATTTATTTT AGCCGACGCA
TTTAGCGATG CGAATTTGGT TCTGACGGTA TTTGTACGCG AGCTGGGCGC TGCGCCGTGG
GTAGTTGGCT TGTTGCCATC GCTCAAATCG GGCGGCTGGC TGCTACCACA ATTGTTGAGT
GCTGGCCGTT TGCAAGGCAT GACCTACAAA TTGCCAGTCT ATCGCCAAGT CGGAATTGTG
CGCTTTTTTA TTTGGCTGGC GATGGTTTTG GTAGTCTGGA ATGCAACTAG TTTGCCAGTT
TGGGTGCTGT TGCCGCTGTT TTTGCTAGGC TATGCGCTCT ACAATTTTAC GGGTGGATCT
GGCTCGGTGG CTTTTCAGGA AGTTGTGGCC AAAACGATTC CTGCCCGTCA GCGCGGCAAA
TTTTTCGGAG CACGCAATTT AATCGGCGGT TTGCTCTCGT TTGCCTTGGT TAGCCCTTTG
GTTGGTTGGT TGCTGAGCCG TTCCAGTCCT TTGCTATTTC CCCACAATTA TGGGGTTTTG
CTGTTTATTT CGTTTGTGTT GATTGGCTTT GGGATTATTT CGTTTAGCCT ATTTGCCGAG
CCGCCGACAA CCAATCCGCC TGCGGCGATT TCGACCAAGC AGATGTTTGC GAAAATTCCG
GTGTTGCTCA AGAGTGACCG CAATTTTCGC CAATATGTGC TTTCGCGCAT GGTCACCCGT
TTGGGTGGCT TGGCTGACCC TTTTTATATT TTGTATGCCC GTGAAGTATT GAATGTGCCA
CCACGCATGA TTGGGGTCTA TTTGGCGGTA CGAGTGTTCT CGGCAGCACT ATCGAACCTC
TTTTGGTCGC GGGTTGGCGA TCAACGGGGC AATCGTTTGT TGATTGTCTT AACTGGTGCG
TTGATCATCA CCGTGCCAAC GTGGGCTTTG TTGGTGATGC CATTTGCCAG TATTTTGGGG
CCAGAAGCCT TGGGTTGGTT TTTTGGCGTA ATTTTCTTGT TGATCGGCCT AAGTGTCGAT
GGCTCGAACA CTGCTAGTTT AACCTATGTG ATGGAGTTAG CACCAGCCGA GCAACGTCCA
GTCTATGTCG GTGTTTGTAA TACCTTGATG GGCATCGCGA CCTTTTTTCC GGTGCTGGGT
GGGGTGTTAT TGGCCCAATT CGGCTATTTA CCCTTGTTTT GGATTAGTGC GGCCAGCGCC
TTTATTGGTT TGTTGCTCTC GCGCCGCTTG CCTGAGCCAC GTATCCACGA AGAACGTAGA
GCATAG
 
Protein sequence
MQWKLALHRP IAPRASNEET LRRNMRLGVA NGVLFILADA FSDANLVLTV FVRELGAAPW 
VVGLLPSLKS GGWLLPQLLS AGRLQGMTYK LPVYRQVGIV RFFIWLAMVL VVWNATSLPV
WVLLPLFLLG YALYNFTGGS GSVAFQEVVA KTIPARQRGK FFGARNLIGG LLSFALVSPL
VGWLLSRSSP LLFPHNYGVL LFISFVLIGF GIISFSLFAE PPTTNPPAAI STKQMFAKIP
VLLKSDRNFR QYVLSRMVTR LGGLADPFYI LYAREVLNVP PRMIGVYLAV RVFSAALSNL
FWSRVGDQRG NRLLIVLTGA LIITVPTWAL LVMPFASILG PEALGWFFGV IFLLIGLSVD
GSNTASLTYV MELAPAEQRP VYVGVCNTLM GIATFFPVLG GVLLAQFGYL PLFWISAASA
FIGLLLSRRL PEPRIHEERR A