Gene Haur_1243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1243 
Symbol 
ID5733151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1448923 
End bp1450257 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content50% 
IMG OID641278383 
Productextracellular solute-binding protein 
Protein accessionYP_001544019 
Protein GI159897772 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACAT CGAAGTCAAC GTTCCGACTC TCTTTTATGC TACTCTTGGT TTTGCTCACC 
AGCATCTTGG CGGCTTGCGG CTCTGAGACC GCTACCACTG CGCCAAGCGG GAGCACCACC
ACTAGCAACG AGCCACGCAC CATCAAACTT TGGCACTACG AAGGTGCTAA CAGCGCCATG
GGTATTGCTT GGGCCGAGTC AATCAAACAA TTTCAAGCAT CACACCCTGG CGTAACGATT
CAGTTTGAAG AAAAAGGCTT CGAGCAAATT CGCCAAACCG CTGGTATGGT GCTCAACTCC
GATGAAACTC CCGATATTTT GGAATACAAC AAAGGGAATG CAACCGCTGG TTTGCTTTCA
ACCCAAGGCT TGCTGACTGA TCTTTCCGAG GTGGCGACCC AACGCGGTTG GGATAAATTG
CTCAGCTCCA GCTTGCAAAC CACCGCCCGC TACGATGAAA AAGGCGTGAT GGGTGCTGGC
AAATGGTTTG GTGTGCCCAA CTATGCCGAA TATGTGATGG TTTATTACAA CAAAGACATG
TTCGCCAAAG CCAACTTGCA AGTGCCAACT ACCTTGGCCG AATTTGAAGC CGTCATGGAT
GCCTTTGTGC AACAAGGGGT CACGCCGCTC TCGGTCGGCG CTGCTGAATA TCCCGCCCAA
CAGATTTTCT ATGAATTGGT GCTGAGCCAA GCTGATCGCG AATTCGTCAA TGCCTTCCAA
CTCTATCAAG GCGATGTCGA TTTCCGTGGC CCTGAGTTTA CCTATGGCGC TGAAAAAATG
GCCGAATGGG TCAGCAAAGG CTATATCAGC AAAGATGCCA CCGGCATCAA AGCCGAAGAT
ATGGGCGTGG CCTTCACCAA TGGCACATTC CCAATCATGA TTTCGGGCAG TTGGTGGTAC
GGTCGCTTCA CCGACGAAAT CAAGGGCTTT GAATGGGGCA CCTTCTTGTT CCCAGGCAAT
AAATTGCACC CTGGCTCAAG CGGCAACATC TGGGCCGTGC CAACCAATGC CAAAAACAAA
GATCTGGTCT ACGATTTCAT CGATATCACG ATGAGCCAAG ATATTCAGAC CTTGTTGGGT
AATTCTGGTG GCGTGCCAGT TAACGCCGAC GTGAGCAAAA TCACCAACGA AAAGAACAAA
GAATTGATCC AAAACTTCGA TGCAATTTCC AAGGCCGATG GCTTAGCCTT CTACCCCGAC
TGGCCAGCCC CAGGCTACTA CGATGTTTTG GTTGCCAACG TTCAAGAGTT GATTGATGGA
ACCAAAACTC CCAGCGAAAT GCTCGATGCA ATCGCTATTC CATATCAAGA AAATCGGGCA
ACGTTAGGCA AATAA
 
Protein sequence
MSTSKSTFRL SFMLLLVLLT SILAACGSET ATTAPSGSTT TSNEPRTIKL WHYEGANSAM 
GIAWAESIKQ FQASHPGVTI QFEEKGFEQI RQTAGMVLNS DETPDILEYN KGNATAGLLS
TQGLLTDLSE VATQRGWDKL LSSSLQTTAR YDEKGVMGAG KWFGVPNYAE YVMVYYNKDM
FAKANLQVPT TLAEFEAVMD AFVQQGVTPL SVGAAEYPAQ QIFYELVLSQ ADREFVNAFQ
LYQGDVDFRG PEFTYGAEKM AEWVSKGYIS KDATGIKAED MGVAFTNGTF PIMISGSWWY
GRFTDEIKGF EWGTFLFPGN KLHPGSSGNI WAVPTNAKNK DLVYDFIDIT MSQDIQTLLG
NSGGVPVNAD VSKITNEKNK ELIQNFDAIS KADGLAFYPD WPAPGYYDVL VANVQELIDG
TKTPSEMLDA IAIPYQENRA TLGK