Gene Haur_3936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3936 
Symbol 
ID5735797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4929953 
End bp4931269 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content52% 
IMG OID641281087 
Productmajor facilitator transporter 
Protein accessionYP_001546698 
Protein GI159900451 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTTG CAAGCAAAGC TAGGCAGGTT ATGGAGCAGC AGGACATCAA CGCCAACGTG 
CGGCATAATG TGGTCGTGAA TGTGGCCGAT GGAGCATTTT TTGGGGCAGC CACGGGCATC
GCCTCGTTTG TGACGGTTAT TCCGCTATTT ATGCATACCT TGACCGATTC GGCCACCTTG
ATTGGCTTGG TTTCGGCAAT TCACTCGGTT GGTTGGCAGT TGCCCCAATT ATTAACGGCG
CGGCGGGTTG CCTCGCTACG TCGCTACAAA CCCATGGTGC TGCTGATGAC AATCAACGAG
CGCTTGCCAT TTTTTGGCTT GGCCCTGATT GCCTGGTTTG CTGCTGATTT GGGCCGCGAA
TTAGCCTTGT GGCTAGCATA TATGTTGTTG ATTTGGCAGG GCTTGGGTGG TGGCTTAACT
GCGACTGCTT GGCAAACCAT GATCGGCAAA ATTATGCCAC ATCGCTGGCG TGGCACATTC
TTTGGGGTCC AATCCGCCGC CGCCAATTTA CTCGCCAGCA TTGGCGCGGT TGCGGCGGGC
GTAATTTTGG ATAAACTGCC CTCGCCGATT GATTTTGTGG TCTGTTTTGG GGTTTCAGGT
GTGGCGATGT TTATTTCGTG GTCGTTTTTA GCTTGGACTA AAGAACCAAG CCGCGAGCCA
GAACATACTC ATGGCAGCCA ACGCGATTTT TGGGCCAGCA TTGCCAATAT TCTCAAGCGC
GATCGTAATT TTCGCTGGTT TTTGGCTTCG CGCATTATCT CGCAGCTTGG CTTGATGGCA
ACCGCTTTTT TTACAGTGTA TGCGGTGCAA CGCTTTGGGC TTGATGATCA GACCGCTGGA
ATTATGACCG CACTCTATTT AATCATCCAA ACTGTCGCCA ACCCAATCAT GGGCTGGCTC
GGCGATCGGA TTGGCTATCG GCGGGTGATG GAAGTAGGCG CACTGTTGGC CTTAGGTGCT
GGCCTAGGTG CGTGGCTCGC GCCAGCTTTG GGTTGGTTTT ATCTGATTTT CGCGATTGCG
GGAGTTGCCA ACGTCGCCTT TTGGACGATG GCTATGGCTA TGACCTTGGA ATTTGGCAGC
CTTGCTGAAC GCCCCAGCTA CATTGGTTTA GCCAATACTT TGACCGCTCC AGCAACTTTA
GTCGCACCGT TGATTGGCGG TTGGTTAGCC GATTCAGCAG GCTATAATTA TACGTTTGCG
GTGGCGGCGG CTGGTGGTTT GCTAACGTGG CTAATTCTAC GCTTCGCTGT ACGCGATCCA
CAATCTGTCA CGTATAGCGA AGCTCATGCC GAGCCACAAG TAGTTGTGGT TTCGTAA
 
Protein sequence
MVFASKARQV MEQQDINANV RHNVVVNVAD GAFFGAATGI ASFVTVIPLF MHTLTDSATL 
IGLVSAIHSV GWQLPQLLTA RRVASLRRYK PMVLLMTINE RLPFFGLALI AWFAADLGRE
LALWLAYMLL IWQGLGGGLT ATAWQTMIGK IMPHRWRGTF FGVQSAAANL LASIGAVAAG
VILDKLPSPI DFVVCFGVSG VAMFISWSFL AWTKEPSREP EHTHGSQRDF WASIANILKR
DRNFRWFLAS RIISQLGLMA TAFFTVYAVQ RFGLDDQTAG IMTALYLIIQ TVANPIMGWL
GDRIGYRRVM EVGALLALGA GLGAWLAPAL GWFYLIFAIA GVANVAFWTM AMAMTLEFGS
LAERPSYIGL ANTLTAPATL VAPLIGGWLA DSAGYNYTFA VAAAGGLLTW LILRFAVRDP
QSVTYSEAHA EPQVVVVS