Gene Haur_4563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4563 
Symbol 
ID5736408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5840220 
End bp5841539 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content52% 
IMG OID641281725 
Productextracellular solute-binding protein 
Protein accessionYP_001547322 
Protein GI159901075 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTA TTTTGAGCTT CCTGCTCGTC AGTATCATGA TGGTCGGTTT ATTGGCAGCT 
TGTGGTGGCG AAACTACCCC AACCACTGCT CCAACCACTG CAGCAGAACC AACCGCCGCC
CCAACCACTG CGGCAGAAGC AACCGCTGTT CCAGAAGCAA CCGCCGCTGC AACTACAGAA
GCAACCGCTG CTCCCGAAGC AACTACCGCC CCTGCAACTG GTGGCGACAC GATGGCAGCA
ACTGGCGATA TCACTTTGTG GCACGCTTAC AGCACCGGTG GCGCTGAAGA TGCAACCTTA
ACCGAGTTGA TTGAAAAAGC CAAAGCTGCT TTCCCAGATG CTAACATCAG CGTCTTGCAA
GTACCATTCG ACCAAGTATT CAGCAAGTTT GAAAATGACG TTGCTGCTGC TGGCGGCCCA
GACTTGCTCT TGGCTCCAAA CGATAGCTTG GGCGATTTGG CTCGCAAGAA CTTGTTGGCC
GACCTTGATG CTTACAAAGC TAACTTGACC AACATCGCTC CTGCTGGCGT TGCTGGGATG
TCGGTTGATG GCAAGCTCTA TGGTATTCCA GAATCATTCA AAGCGGTTGC TTTGTACTAC
AACAAATCAA CGATTGCCAC CCCACCATCA ACCACCGACG AGTTGTTGCA ATTGGTCAAA
GATGGCAAGA AATTGGTCTT GAACCAAAGC GCTTACCACA ACTTTGGCTT CTTCCAAGCT
TTCGGTGCTA GCTTGTTCAC CGCTGACAAG GCTTGTGGCT TGGTCAATGG TGGCGGCGAT
GCCTTGAAGT ATTTGCAAGA TCTCAAAGCT GCTGGCGCAA CCTTCTCAAC CGATGGTGCT
CAAGCTGATG CGCTCTTCCG CGAAGGCCAA GCTGACATGA TCATCAACGG GCCATGGGTT
TTGGCCGACT ACCAAGCTGC TTTGAGCGAC AAACTTGGTG TTGCCGCAAT GCCTGCTGGT
CCTAAGGGTC CTGCTGGTCC ATTGACTGGC GTTGACGGTT TCTATGTCAA CATCAACAGC
CAAAATGTTG AAGGTGCAGT TGCTTTGGCA ATGTACTTGA CCAACACCGA ATCACAAAAA
ATCTACACCG AAAAAGCTGG CCACGTTCCA GCTGATGTCA ACGTTGTACC AACCGATGCG
TTGGTGCTCG GCTTCAGCCA AGCTGCTTCA ACTGGCTATG CTCGCCCACA AGACCAAGAA
CTTAACAACT TCTGGACTCC AGTTGGCGAT GCTGTAACCA AAGCACTTGA TGGCGGCGAA
GATGCAACCA AGGCGATCAC TGATGCCTGT GCTGCAATGG ATACCGCTAA CGGTAAATAA
 
Protein sequence
MKRILSFLLV SIMMVGLLAA CGGETTPTTA PTTAAEPTAA PTTAAEATAV PEATAAATTE 
ATAAPEATTA PATGGDTMAA TGDITLWHAY STGGAEDATL TELIEKAKAA FPDANISVLQ
VPFDQVFSKF ENDVAAAGGP DLLLAPNDSL GDLARKNLLA DLDAYKANLT NIAPAGVAGM
SVDGKLYGIP ESFKAVALYY NKSTIATPPS TTDELLQLVK DGKKLVLNQS AYHNFGFFQA
FGASLFTADK ACGLVNGGGD ALKYLQDLKA AGATFSTDGA QADALFREGQ ADMIINGPWV
LADYQAALSD KLGVAAMPAG PKGPAGPLTG VDGFYVNINS QNVEGAVALA MYLTNTESQK
IYTEKAGHVP ADVNVVPTDA LVLGFSQAAS TGYARPQDQE LNNFWTPVGD AVTKALDGGE
DATKAITDAC AAMDTANGK