Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4563 |
Symbol | |
ID | 5736408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5840220 |
End bp | 5841539 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281725 |
Product | extracellular solute-binding protein |
Protein accession | YP_001547322 |
Protein GI | 159901075 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGTA TTTTGAGCTT CCTGCTCGTC AGTATCATGA TGGTCGGTTT ATTGGCAGCT TGTGGTGGCG AAACTACCCC AACCACTGCT CCAACCACTG CAGCAGAACC AACCGCCGCC CCAACCACTG CGGCAGAAGC AACCGCTGTT CCAGAAGCAA CCGCCGCTGC AACTACAGAA GCAACCGCTG CTCCCGAAGC AACTACCGCC CCTGCAACTG GTGGCGACAC GATGGCAGCA ACTGGCGATA TCACTTTGTG GCACGCTTAC AGCACCGGTG GCGCTGAAGA TGCAACCTTA ACCGAGTTGA TTGAAAAAGC CAAAGCTGCT TTCCCAGATG CTAACATCAG CGTCTTGCAA GTACCATTCG ACCAAGTATT CAGCAAGTTT GAAAATGACG TTGCTGCTGC TGGCGGCCCA GACTTGCTCT TGGCTCCAAA CGATAGCTTG GGCGATTTGG CTCGCAAGAA CTTGTTGGCC GACCTTGATG CTTACAAAGC TAACTTGACC AACATCGCTC CTGCTGGCGT TGCTGGGATG TCGGTTGATG GCAAGCTCTA TGGTATTCCA GAATCATTCA AAGCGGTTGC TTTGTACTAC AACAAATCAA CGATTGCCAC CCCACCATCA ACCACCGACG AGTTGTTGCA ATTGGTCAAA GATGGCAAGA AATTGGTCTT GAACCAAAGC GCTTACCACA ACTTTGGCTT CTTCCAAGCT TTCGGTGCTA GCTTGTTCAC CGCTGACAAG GCTTGTGGCT TGGTCAATGG TGGCGGCGAT GCCTTGAAGT ATTTGCAAGA TCTCAAAGCT GCTGGCGCAA CCTTCTCAAC CGATGGTGCT CAAGCTGATG CGCTCTTCCG CGAAGGCCAA GCTGACATGA TCATCAACGG GCCATGGGTT TTGGCCGACT ACCAAGCTGC TTTGAGCGAC AAACTTGGTG TTGCCGCAAT GCCTGCTGGT CCTAAGGGTC CTGCTGGTCC ATTGACTGGC GTTGACGGTT TCTATGTCAA CATCAACAGC CAAAATGTTG AAGGTGCAGT TGCTTTGGCA ATGTACTTGA CCAACACCGA ATCACAAAAA ATCTACACCG AAAAAGCTGG CCACGTTCCA GCTGATGTCA ACGTTGTACC AACCGATGCG TTGGTGCTCG GCTTCAGCCA AGCTGCTTCA ACTGGCTATG CTCGCCCACA AGACCAAGAA CTTAACAACT TCTGGACTCC AGTTGGCGAT GCTGTAACCA AAGCACTTGA TGGCGGCGAA GATGCAACCA AGGCGATCAC TGATGCCTGT GCTGCAATGG ATACCGCTAA CGGTAAATAA
|
Protein sequence | MKRILSFLLV SIMMVGLLAA CGGETTPTTA PTTAAEPTAA PTTAAEATAV PEATAAATTE ATAAPEATTA PATGGDTMAA TGDITLWHAY STGGAEDATL TELIEKAKAA FPDANISVLQ VPFDQVFSKF ENDVAAAGGP DLLLAPNDSL GDLARKNLLA DLDAYKANLT NIAPAGVAGM SVDGKLYGIP ESFKAVALYY NKSTIATPPS TTDELLQLVK DGKKLVLNQS AYHNFGFFQA FGASLFTADK ACGLVNGGGD ALKYLQDLKA AGATFSTDGA QADALFREGQ ADMIINGPWV LADYQAALSD KLGVAAMPAG PKGPAGPLTG VDGFYVNINS QNVEGAVALA MYLTNTESQK IYTEKAGHVP ADVNVVPTDA LVLGFSQAAS TGYARPQDQE LNNFWTPVGD AVTKALDGGE DATKAITDAC AAMDTANGK
|
| |