Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4463 |
Symbol | |
ID | 5736314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5707502 |
End bp | 5708737 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281626 |
Product | extracellular solute-binding protein |
Protein accession | YP_001547223 |
Protein GI | 159900976 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00452134 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGCAG TATGGCGGTA TTTTAGCCTT TTTATGATTG GGATTCTCAT TGTTGCTGGT TGTAGCACGA CAAGCTATGA TAACACTTTG TTAACCCCAG GGGCTACCGC TAATCCTCCC GCACGCACGC AACTGGTCAT TTGGCATGGA GCCGATGCCC AACGGAGTGA ACTTTTCACC CGATTGCTGT TAGAATATCA ACGTGAGCAT CCAAACGTTG TCATTCAAAT CGTCAATCGT GGGGCAAACC TGCTACACGA TTATCGTGCG GCCCTGCTTG AAGGCACACC ACCAGACCTG ATCTGGCTCA ACGAAAATCG TTGGGTAGGG GAACTGGCAG ATCAACAGCT TATCATCGAT CTGACAGAAC GACTAAGCGA CGAGAACCTT GAGTCAATTG CGCCAGCTGC GCTTGATGGT GCCCGCTATG GCGAGAAATT ATATGGCTTG CCATTGACGC TAGATCTACC TGTGCTGTAC TATAATCGTG CAAACTTTGT GAGCACACCG CCGCAAAGCA CTGCTGAATG GCTTGAGATC GCTCGCGGGT TTAGCGATGA TCAAGGACAG TACGGATTAG CGTATAATTT ATCGCTATAC TTTACCCAAC CCTACCTCCC AGCCTTCGGA GGCGCAATCT TCGATACTAC TGGCGAGGTC GTGCTGGGAA CCCAAAGCTA TACCCCAACA TTACAGTGGT TGACGTGGGT TGACGAATTA GCCCAAGACC CACGCTTGTT AGCCCGTGAT GATCATCGAC TGGTAGCTCG CAGCGTGAGC CAAAACAGTG CGATCATGAC GATTGACTGG GCAGATCAAA TCGGAACCTA TCGTCAATTG TGGGGGGAGA ACGTTGGCGT GCAACCATTG CCACGCCTGA GCCAAACCGG CCAAGAACCG CAACCCTTTG TTCGGAGCAG CGTGCTTGTG ATCAGCCCAC GCAGCACTGA ACAACAGCAA AACGCCGCGC TTGACGTGAT GCGGTTTCTT GTGGAGATGA AAGCGCAAAC GGCTTTTCAA GCCGCTGATA TACCAAGCGT CCGAATCGAC TTAGCCAGTG TCGATCCACT TTACACTCAA ATTCAACTGG CGGTTAGTCG AGCCAGCGCT TGGCCCACCA CCCTTCGTTT CAACAACGGA TGGGATATCC TGATCGCTTT GGTGCGTAAT AGCTTAAACG GCGCACCCTT GGAAGAAAGT ATCGCGAATG CCGATCGCCT GCTACGGAGC GAGTAG
|
Protein sequence | MRAVWRYFSL FMIGILIVAG CSTTSYDNTL LTPGATANPP ARTQLVIWHG ADAQRSELFT RLLLEYQREH PNVVIQIVNR GANLLHDYRA ALLEGTPPDL IWLNENRWVG ELADQQLIID LTERLSDENL ESIAPAALDG ARYGEKLYGL PLTLDLPVLY YNRANFVSTP PQSTAEWLEI ARGFSDDQGQ YGLAYNLSLY FTQPYLPAFG GAIFDTTGEV VLGTQSYTPT LQWLTWVDEL AQDPRLLARD DHRLVARSVS QNSAIMTIDW ADQIGTYRQL WGENVGVQPL PRLSQTGQEP QPFVRSSVLV ISPRSTEQQQ NAALDVMRFL VEMKAQTAFQ AADIPSVRID LASVDPLYTQ IQLAVSRASA WPTTLRFNNG WDILIALVRN SLNGAPLEES IANADRLLRS E
|
| |