Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1150 |
Symbol | |
ID | 5733043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1320383 |
End bp | 1321708 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278290 |
Product | extracellular solute-binding protein |
Protein accession | YP_001543926 |
Protein GI | 159897679 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.344094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGAC CATGGATTCG CTCGTTTACC TTGCTGATCG GCTTAATTTT GGCGGCATGT GGCGAGGCGA CTACGCCAAC CACGCCCCCG ACCAACCCAA CTACGGCAAC TGGCGCTAGC AGTGCGGCTA GTGGCACGGT CACGTTGTGG TTTCACTCCG GTCAAGGTGC TGAACGCGAT GCCTTAAACG CAACGCTCCA AGCATTTGCA GCCAAAAACT CAGCAATCAA AGTTGAAGCC ATTGAATTGC CTGAAGGCGC ATATAACGAT CAAGTCAATG CTGCTGCCTT GGCTGGCGAA TTGCCCTGCT TGCTCGATTT TGATGGCCCA TTTGTCTATA ACTATGCATG GTCGGGCTAT TTGCAACCAT TGGATAGTTT GATCGCTGCC GATGTCAAAG CCGATTTTCT GCCTTCAATC ATTGAACAAG GCACTTACAA CGGCAAATTG TATAGCCTTG GGCAGTTCGA TTCGGGCTTA GGCTTCTATG CCAACAAGGA ATTGTTGGAA AAAGCTGGGG TGCGCATTCC AACTTTAGCC CAGCCATGGA CTCGCGCTGA GCTTGATGAG GCCTTGAGCG AACTTAAAGC CAATGGTTTG GAATATCCAC TTGACTTGAA AATGGACTAT GGCCGTGGCG AGTGGTTTAG TTATGGCTTT TCACCCTTCT TGCAATCTTT TGGCGGCGAT TTGATCGATC GCTCAACGTA TCAAAAAGCC AGCGGCAGCT TGAATAGCGC GGCTTCGGTC GAAGCAATGA AGTGGTTCCA AGGCCTCTTC ACCAATGGCT ATGTTAATCC TAAGCCTGCT GGCAGCACCG ATTTTGCTGA GGGTAAAGCG GCTTTGAGTT GGGTTGGGCA CTGGGCCTAC CCTGATTATG CCAAAGCCTT GGGCGATAAA TTGTTGGTGC TGCCTGCCGC CGATTTGGGC AAGGGTGCGA AAACGGGCAT GGGTTCGTGG AATTGGGGCA TTACCAGCAA GTGTGCTAAT CCGGCGGCTG CCGCTGAAGT GCTTTCGTTC ATCGTCTCGC CCGAAGAAGT GCTGCGCATG AGCGATGCCA ATGGCGCTGT GCCAGCGCGT ACTTCAGCAA TTGCCAAATC CAAATTGTTT GGTGATGGCG CTCCGTTGAA CCTCTATGTG CAACAATTGA CCAATGGCGT TGCCATGCCG CGCCCAATTA CCCCAGCCTA CCCAGTCATT ACCGTCGCCT TTGCCGAAGC CGTCGATAAC ATTGTGGCTG GAGCCGATGT GCAAGCCGAG TTGGATAAAG CGGCCCAAAA GATCGATGCC GATATTGAAG ATAATCAAGG CTATCCCGTG AAGTAA
|
Protein sequence | MQRPWIRSFT LLIGLILAAC GEATTPTTPP TNPTTATGAS SAASGTVTLW FHSGQGAERD ALNATLQAFA AKNSAIKVEA IELPEGAYND QVNAAALAGE LPCLLDFDGP FVYNYAWSGY LQPLDSLIAA DVKADFLPSI IEQGTYNGKL YSLGQFDSGL GFYANKELLE KAGVRIPTLA QPWTRAELDE ALSELKANGL EYPLDLKMDY GRGEWFSYGF SPFLQSFGGD LIDRSTYQKA SGSLNSAASV EAMKWFQGLF TNGYVNPKPA GSTDFAEGKA ALSWVGHWAY PDYAKALGDK LLVLPAADLG KGAKTGMGSW NWGITSKCAN PAAAAEVLSF IVSPEEVLRM SDANGAVPAR TSAIAKSKLF GDGAPLNLYV QQLTNGVAMP RPITPAYPVI TVAFAEAVDN IVAGADVQAE LDKAAQKIDA DIEDNQGYPV K
|
| |