Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3496 |
Symbol | |
ID | 5735357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4402834 |
End bp | 4404606 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280643 |
Product | extracellular solute-binding protein |
Protein accession | YP_001546260 |
Protein GI | 159900013 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGACC TTGGATTGAT GCTCATGAAA TTGCGTTTTA GCCTTGCGCT GTTGCTCTGT TTTAGCTTGA TTGGTTGTAG TATTGAGGGT GGTGCACCAG CTGGTCAGCC CACTGTCCAA GTGCCAACTC CCTTGCCAAC GACCAACCCT GCCACTTCCA ATCTTTGGAC GTTGGGCTTA ACGGAGGAGC CAGTTGATCT GTATCCCTAC AGTTATGCCT TTCGTGGCGC AGCTCCATTG ATTGAATTGC TTTACCCTGC GCCTTTGACG GTGGTTGCTG AAGCCTATAC CACAACTGGC GTTTTGGAGC GCGTGCCATC ATTCGAAAAT GGTGATGTGC AATATATTAC AACCACGGTT AATCTTGATG CGAATGGCGT AATTACCACC ACGCAAACCG AAACGATTAC CAGCGTGCGT CAGTTGACCG TCACCTATCG CTGGAATAAA AATTTGCAAT GGGAAGATGG CCAACCATTA ACCGCAGCTG ATTCAGTTTT TGCCTATAAT TTGTTGCGTG GTAGCGCCTC AACCGCCCAA TTGGCAACCC AAAGCGACTT AACCGCCGAT TATGTGGCGA TTGATCAATA TACGACTCGT GCTTATTTGC CGCCCGAACG CGACGACCCA AATTATTTGG TGACGGTCTG GACTCCATTG CCAGCGCATT TGTTTGAAGG TCAGCCAAGT GCCAAAGAAG TTAGCGATCG TTTGGCCCAG TCGCCAGTTG GCTATGGCCC GTATACGCTC AAAGCCTGGA CGGCAGGCAC GCAGTTGGAA TTTGTGCGAC GCGAAGGTCA AACGGAACAA TTGCCTAGCA CGATTATTGC CCGTTTATAT CCTGATATTG CCATGATGCG CGACGATGTG CTGAGCGGGC GGGTCGATGT GGCTTGGACA GAAGGTTCGT TGGAGCAATT AGCGCTTGAT CTGAAAACTG ATGTGCAATC CAAAACCTTG CAATTGCTCC AAGCTCCCAA CCCAATTTGG GAACATATCG ACATGAATTT GGCGGTTGTA GCTTTGCAAG ATATTCGGAT GCGCCAAGCG ATTGCCCATG GTTTTGACCG TGAGGCGATT AGCACAACCT TGTATGGCAC GCCCAAAGCA GTTTTGCATA GTTGGTTGGC GGCTGAATCA TGGGCTTTTG ATCCAACAAC AGTGGTCAGT TATACCTTTG ATCCTGCGCT TTCACGCCAA TTGCTTGATG AAATGAATTA TCGTGATACC AATGATGATG GTTTGCGCGA ACGCCCTGAT GGTACGCCCT TCCAATTGAC ATTAACCACT TCGGCTCAAA CCCCGATTCG CCAACGCCTG AGTGAGCAAT TTGTCAGCGA TATGCAGGCA ATCGGGATTG ATATTAAGGT TGAAGCCTTA TCAACCACCG ATTTGTATAG CCAGCAAGGG CCATTATTTG GGCGACGCTT TGAATTGGCC TTGTTTGGCT GGCTGCGCAG CGTTGATCCC GATGGCGCGG TGTTGTGGAG TTGTGCCGCG ATTCCTAACC AAATTAACGG CTACTCCGGC GATAATTTTA CTGGTTGGTG TATGGATACC GCCGATCGGG CGATTCGTAC CGCGACCAGT TCGCTTGATC CGGCGGTGCG TAAGGCTGCC TATAGCGAAC AACAGCAAAT TTTCACCCGC GAATTGCCAG TCTTGCCCGT GATTACCCGC CAAACCACGG TGTTGCTTGC GCCGAATGTG CGAGGTGTGC AACCGCAACC ATTGGCTCCA ATTACTTGGA ATGTGGGTGC TTGGCAACGC TAG
|
Protein sequence | MLDLGLMLMK LRFSLALLLC FSLIGCSIEG GAPAGQPTVQ VPTPLPTTNP ATSNLWTLGL TEEPVDLYPY SYAFRGAAPL IELLYPAPLT VVAEAYTTTG VLERVPSFEN GDVQYITTTV NLDANGVITT TQTETITSVR QLTVTYRWNK NLQWEDGQPL TAADSVFAYN LLRGSASTAQ LATQSDLTAD YVAIDQYTTR AYLPPERDDP NYLVTVWTPL PAHLFEGQPS AKEVSDRLAQ SPVGYGPYTL KAWTAGTQLE FVRREGQTEQ LPSTIIARLY PDIAMMRDDV LSGRVDVAWT EGSLEQLALD LKTDVQSKTL QLLQAPNPIW EHIDMNLAVV ALQDIRMRQA IAHGFDREAI STTLYGTPKA VLHSWLAAES WAFDPTTVVS YTFDPALSRQ LLDEMNYRDT NDDGLRERPD GTPFQLTLTT SAQTPIRQRL SEQFVSDMQA IGIDIKVEAL STTDLYSQQG PLFGRRFELA LFGWLRSVDP DGAVLWSCAA IPNQINGYSG DNFTGWCMDT ADRAIRTATS SLDPAVRKAA YSEQQQIFTR ELPVLPVITR QTTVLLAPNV RGVQPQPLAP ITWNVGAWQR
|
| |