Gene Haur_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1139 
Symbol 
ID5733031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1302340 
End bp1303527 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content51% 
IMG OID641278278 
Productmajor facilitator transporter 
Protein accessionYP_001543915 
Protein GI159897668 
COG category 
COG ID 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000165449 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGCC GTTTGGCGCT CTGGCGGTCG AATGCACGTC CCGGGGTTCG GGGTAGTTTG 
TACTATTTAT GTTTTTGGTC AAGCGTTGGG ATGTATATTC CATTCATTAA TGTATATTTC
ACCAACCTTG GCTTAAGTGG GCAACAAATA GCCATGTTTG GGGCAATTAG CCCGCTAGCA
GTGCTGTTGT TCAACCCATT GGTGGGTGCA ACAGCCGATC GACGTGGCTG GCATGTGCAG
TTACTCCTAA GCATGTTGGC CTTAACGGGG TTAAGCCAAA TTGCCTTGGC ATTTCCAACA
ACCTTCTTTA CAATCTTGCC AGTGATGGTC GTTTTGGCAG TGGTACGCGG GCCAATTGCG
CCATTGGCTG ATAGCATGAT CGCAGGCATG GCCGTGCGTC ATCAACTGGC CTATGGCAAA
TTGCGGCTCT GGGGTTCAGT TGGCTATGCC GTCACCTCAT TATTAGGTGG TATTTGGTGG
GCCAAAACGG GCTATCCAAC CATGTTTATA TTGACTGGCT TGATGACTGG CTTGGTCGCG
ATCGTAGCCA ATAGCCTTGA TCATACGCCT GAATTGCGCA AAACTGCTGC CAAATCGGCG
AAAGCGCCCC GTGATGCCGC TTTTATCGCA ATTGTCGTGA TCACTAGTTT GGTTGGGGCT
GCCTTTAGCA TGGTTTCAAT GTTCGATGGC AACCTGATTC AACGGATCAG TGGGAGCACC
ATCATGCTGG GGGTTTTGCC ATGTGTCATC GCCAGTACCG AAGTGCCAGT GATGCTCAAC
GCCGATCGGG TGATCGCTCG CTTTGGCACA GCTAAAACAC TAGCCGTTTC AACCTTGATT
CTTGGGCTAG GGTTTATTGG CAGCGGCATG GTGAGCGAAG CTTGGATGTT AATTCCAATT
GGCATGTTTC GGGCTTGTGG ATTTGGCTTG TACTCGGTGG CGATCATTCG CCTAATTACC
GAGCGCATTC CAACCACCTT GCTGGCAACC GCGCAAGGCT TGATCAGTGC GATTGCTGGT
GGTTTGTCGC CGTTGTTGGC TACCCAAGCG GGTGGCTATA TGTTCGATAT ATCAGGGCCA
CAATTGGTCT TTATTGCATC AGGCTTGTGC ATTGGCTTGG CAACCTTGGT CGTTTGGCTA
GGCTTAAAAC TGAATTGGTT CAAACCAATC GCGCAAACCA ACGCCTAA
 
Protein sequence
MGSRLALWRS NARPGVRGSL YYLCFWSSVG MYIPFINVYF TNLGLSGQQI AMFGAISPLA 
VLLFNPLVGA TADRRGWHVQ LLLSMLALTG LSQIALAFPT TFFTILPVMV VLAVVRGPIA
PLADSMIAGM AVRHQLAYGK LRLWGSVGYA VTSLLGGIWW AKTGYPTMFI LTGLMTGLVA
IVANSLDHTP ELRKTAAKSA KAPRDAAFIA IVVITSLVGA AFSMVSMFDG NLIQRISGST
IMLGVLPCVI ASTEVPVMLN ADRVIARFGT AKTLAVSTLI LGLGFIGSGM VSEAWMLIPI
GMFRACGFGL YSVAIIRLIT ERIPTTLLAT AQGLISAIAG GLSPLLATQA GGYMFDISGP
QLVFIASGLC IGLATLVVWL GLKLNWFKPI AQTNA