Gene Haur_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4033 
Symbol 
ID5735895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5148672 
End bp5149961 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content50% 
IMG OID641281184 
Productextracellular solute-binding protein 
Protein accessionYP_001546793 
Protein GI159900546 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGCA AATGGCGTTG GAAATTCGCT TTGGTTAGCC TATTAACGAC CGTTTTAGCC 
GCGTGTGGTG GCGGCAGCGC AACCCAAGCT CCTACCACCA ATCCCAATCG GGTCACGCTG
ACGATTGAAA GCTGGCGCAG CGACGACCAA AAGGTCTGGA CTGAGCAGAT TATCCCGCTA
TTTGAAAAGA GCCATCCTGA TATTCATGTG GAGTTTCGGC CCACCGCGCC GACCGAATAC
AACGCAGCGC TCAATAGCAA ACTAGAGGCT GGCACTGCTG GCGATCTGAT CACCTGCCGG
CCTTTTGATG TTTCATTAGG CTTGTATACT GCGGGGCATT TGACTGCACT CGACGATTTG
CCAGGCATTG CCCATTTTAG CGATGTAGCC AAGAGCGCTT GGCAAACCGA CGATGGCAAG
ACGCTCTTTT GCTTGCCAAT GGCCTCGGTG ATCCATGGCT TTATCTACAA CAAAGCGATT
TTTAACGAGC TAGGTTTGAC TGAACCCAAA ACTGTCGATG AGTTTTTCGC GGTGCTCGAT
AAAATTAAGC AAGATGGCCG CTATACACCA TTGGTGATGG GTACTGCCGA GCAGTGGGAA
GCAGCAACCA TGGGCTTCCA AAACATCGGG CCAAATTATT GGTTTGGTGA AAATGGCCGT
AAAGCGCTAA TCGATGGCAG CGCCAAATTA TCCGATGCGG CCTATGTTTC GACCTTAAAA
GATTTGGCAC GCTGGGGCGA TTATTTGCCC GATGGCTTTG AAGCAGTCAA ATATGCCGAT
GCTCAGATGT TGTTTGCCTC AGGCAAAGGG GCAATTTACC CCGCTGGCTC GTGGGATATT
AGCTATTTCA ACCAAAATGC CAAATTCGAG CTTGGCGCAT TCAAAGCTCC GGTTGCCAAA
GCCAATGATC AATGCTTCAT CAGCGACCAT ACCGATATTG GAATTGGGAT TAATGCTAAG
ACTGCCCACA TGGCTCAAGC CCGTACCTTC CTTGAGTGGA TGACTGGGCC AGAATTTGCT
GGCGAATTCA GCAATGCCTT GCCTGGCTTC TTCAGCCTCT CGGATTATCC AGTGAGCTTG
AAAAATCCCT TGGCCCAAAC CTTTGTTGAT TGGCGCAAAC AGTGTCAATC GACAATCCGC
AATTCCTACC AAATTCTCTC ACGCGGCGAG CCAAACCTCG AAAACGAACT CTGGCGGATC
AGCGTTGGGG TGATTGATGG AACATTTGCG CCTGAAGATG CGGCCCAAGA TCTGCAAAAG
GGCTTAGATA ATTGGTACAA AAAGCCCTAG
 
Protein sequence
MVRKWRWKFA LVSLLTTVLA ACGGGSATQA PTTNPNRVTL TIESWRSDDQ KVWTEQIIPL 
FEKSHPDIHV EFRPTAPTEY NAALNSKLEA GTAGDLITCR PFDVSLGLYT AGHLTALDDL
PGIAHFSDVA KSAWQTDDGK TLFCLPMASV IHGFIYNKAI FNELGLTEPK TVDEFFAVLD
KIKQDGRYTP LVMGTAEQWE AATMGFQNIG PNYWFGENGR KALIDGSAKL SDAAYVSTLK
DLARWGDYLP DGFEAVKYAD AQMLFASGKG AIYPAGSWDI SYFNQNAKFE LGAFKAPVAK
ANDQCFISDH TDIGIGINAK TAHMAQARTF LEWMTGPEFA GEFSNALPGF FSLSDYPVSL
KNPLAQTFVD WRKQCQSTIR NSYQILSRGE PNLENELWRI SVGVIDGTFA PEDAAQDLQK
GLDNWYKKP