Gene Haur_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1150 
Symbol 
ID5733043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1320383 
End bp1321708 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content52% 
IMG OID641278290 
Productextracellular solute-binding protein 
Protein accessionYP_001543926 
Protein GI159897679 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.344094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAC CATGGATTCG CTCGTTTACC TTGCTGATCG GCTTAATTTT GGCGGCATGT 
GGCGAGGCGA CTACGCCAAC CACGCCCCCG ACCAACCCAA CTACGGCAAC TGGCGCTAGC
AGTGCGGCTA GTGGCACGGT CACGTTGTGG TTTCACTCCG GTCAAGGTGC TGAACGCGAT
GCCTTAAACG CAACGCTCCA AGCATTTGCA GCCAAAAACT CAGCAATCAA AGTTGAAGCC
ATTGAATTGC CTGAAGGCGC ATATAACGAT CAAGTCAATG CTGCTGCCTT GGCTGGCGAA
TTGCCCTGCT TGCTCGATTT TGATGGCCCA TTTGTCTATA ACTATGCATG GTCGGGCTAT
TTGCAACCAT TGGATAGTTT GATCGCTGCC GATGTCAAAG CCGATTTTCT GCCTTCAATC
ATTGAACAAG GCACTTACAA CGGCAAATTG TATAGCCTTG GGCAGTTCGA TTCGGGCTTA
GGCTTCTATG CCAACAAGGA ATTGTTGGAA AAAGCTGGGG TGCGCATTCC AACTTTAGCC
CAGCCATGGA CTCGCGCTGA GCTTGATGAG GCCTTGAGCG AACTTAAAGC CAATGGTTTG
GAATATCCAC TTGACTTGAA AATGGACTAT GGCCGTGGCG AGTGGTTTAG TTATGGCTTT
TCACCCTTCT TGCAATCTTT TGGCGGCGAT TTGATCGATC GCTCAACGTA TCAAAAAGCC
AGCGGCAGCT TGAATAGCGC GGCTTCGGTC GAAGCAATGA AGTGGTTCCA AGGCCTCTTC
ACCAATGGCT ATGTTAATCC TAAGCCTGCT GGCAGCACCG ATTTTGCTGA GGGTAAAGCG
GCTTTGAGTT GGGTTGGGCA CTGGGCCTAC CCTGATTATG CCAAAGCCTT GGGCGATAAA
TTGTTGGTGC TGCCTGCCGC CGATTTGGGC AAGGGTGCGA AAACGGGCAT GGGTTCGTGG
AATTGGGGCA TTACCAGCAA GTGTGCTAAT CCGGCGGCTG CCGCTGAAGT GCTTTCGTTC
ATCGTCTCGC CCGAAGAAGT GCTGCGCATG AGCGATGCCA ATGGCGCTGT GCCAGCGCGT
ACTTCAGCAA TTGCCAAATC CAAATTGTTT GGTGATGGCG CTCCGTTGAA CCTCTATGTG
CAACAATTGA CCAATGGCGT TGCCATGCCG CGCCCAATTA CCCCAGCCTA CCCAGTCATT
ACCGTCGCCT TTGCCGAAGC CGTCGATAAC ATTGTGGCTG GAGCCGATGT GCAAGCCGAG
TTGGATAAAG CGGCCCAAAA GATCGATGCC GATATTGAAG ATAATCAAGG CTATCCCGTG
AAGTAA
 
Protein sequence
MQRPWIRSFT LLIGLILAAC GEATTPTTPP TNPTTATGAS SAASGTVTLW FHSGQGAERD 
ALNATLQAFA AKNSAIKVEA IELPEGAYND QVNAAALAGE LPCLLDFDGP FVYNYAWSGY
LQPLDSLIAA DVKADFLPSI IEQGTYNGKL YSLGQFDSGL GFYANKELLE KAGVRIPTLA
QPWTRAELDE ALSELKANGL EYPLDLKMDY GRGEWFSYGF SPFLQSFGGD LIDRSTYQKA
SGSLNSAASV EAMKWFQGLF TNGYVNPKPA GSTDFAEGKA ALSWVGHWAY PDYAKALGDK
LLVLPAADLG KGAKTGMGSW NWGITSKCAN PAAAAEVLSF IVSPEEVLRM SDANGAVPAR
TSAIAKSKLF GDGAPLNLYV QQLTNGVAMP RPITPAYPVI TVAFAEAVDN IVAGADVQAE
LDKAAQKIDA DIEDNQGYPV K