Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2247 |
Symbol | |
ID | 5734134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2864289 |
End bp | 2865896 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279388 |
Product | extracellular solute-binding protein |
Protein accession | YP_001545015 |
Protein GI | 159898768 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0125251 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAC GTTTATCGAT TTTAGGTCTC TTGCTACTGG TTTTAGTTGG CTGCGATCTT GGGAGCAACC AAGCTACCCC AGCACCTGCA TCAACTCCAG TAGCCCAAAG CCAAGCTGGT GGCGGCGGCA ATTTTATTTT GACGCTTGGC GATGATCCAG CGACGATCGA CCCAGCATTG GTTGGCGATA CGACCAGCGG TTTTATTGCC CGCTTGATGT TTAGCGGTTT GGTGACGCTC AACAACGAGC TTGAAGCAGT CCCCGATTTA GCTGAAACGA TCGATGTTTC GGCTGATGGC ACGGTATATA CCTTTAAACT GCGCTCGAAT GCACGCTTTG CCGATGGTAC GCCGATTACT GCTGAAGATG TGCGTTGGAG TCTCGAACGG GCCACCGACC CTAGCCTTGG CTCAATTGTT TCGCCAACCT ACCTCGATGA TGTTGCTGGA GTGCTTGAAA AAGTGACTGG GCAAGCCAAC TCGCTCAGTG GGGTCAAGGT GGTTGATGAT CAAACAATCG CGATTACGCT GCGCCAGCCA AGCTCGCTAT TTTTGCTCAA ATTGACTCAC CCGCCAGCCT TTGTGCTCGA TCGGCGCACA GTTGAGGACA ATAGCGATTG GCTCGAAAAA CCCAATGGCT CAGGCCCGTT TATGCTCGAT CTATGGAATC ATCGCCGCCG CATGGAGCTT GTGCCCAACC CATACTACTA TGGCACTGCC CCCAAATTGG ATCGGATTAC CTATCTGATT GGGGCGGAAG GTAGTAATCC GCTGGGGTTG TATGAGCAGG GCGAAATCGA CGTGACGGGC ATTGGCAGCT ATGATCTTGA TCGGATTAAT GATGAGGCTG ATCCGTTGCA CGCTGAGCTG CGGATCACGC CACAGTTGCA ATTAAGCTAT ATTGGCTTGA ATGTGAATCA ACCGCCATTT GACGATCCGA AGGTGCGCGA AGCCTTTTAT TTGTTGATCG ATCGGGTTAA ATTGGCCGAT GTTTCGTACA ATGGCTCGGT GGTCGCAGCC CGGGGGATTT TGCCACCTGG CATGCCTGGA GCCGAGCCAG AACGTTTGCC TGAGCCAAAC GCCGATATTG CCCGTGCCAA ACAACTAATC AGCGAATCGA GCTATGGTAG CGTTGAAAAG TTTCCGCCGA TTATTGGCTA TAGCAGCGGT AGCGGGGTAG GTTTGCTGGC CCAAATTGCC AAAGATGAGC TTGGGGTAAC GATTGAAATT CGTGGTCAAG ATCAATTTGG CGATTATTTG GCAGCACTTG AGCGCGATAA TTATCATTTG TATGACCTCA GTTGGATCGC CGACTATCCC GATCCACAAA ACTTTTTAGA GGTGTTGTTT GGCAGCAAGG GTCAATACAA TCGCACCAAT TACAGCAACG CGAAGTTTGA TCAACTGATT GAGCAAGCCA AAGCCGAGGC CGATGCTGAA AAACGTGGGG CACTCTATCG CCAAGCCGAA GAGCAATTGC TCAGTGATTT TGTGGTCATT CCTTTGGTGC ATACTGTTGA TTATTCCTTG GTAAAATCGT ATGTTGATGG TTACATGATT ACGGCTTTGG GTGAGCTAGA TTTAACTGGA GTTTCGCTCA AACGCTAG
|
Protein sequence | MDKRLSILGL LLLVLVGCDL GSNQATPAPA STPVAQSQAG GGGNFILTLG DDPATIDPAL VGDTTSGFIA RLMFSGLVTL NNELEAVPDL AETIDVSADG TVYTFKLRSN ARFADGTPIT AEDVRWSLER ATDPSLGSIV SPTYLDDVAG VLEKVTGQAN SLSGVKVVDD QTIAITLRQP SSLFLLKLTH PPAFVLDRRT VEDNSDWLEK PNGSGPFMLD LWNHRRRMEL VPNPYYYGTA PKLDRITYLI GAEGSNPLGL YEQGEIDVTG IGSYDLDRIN DEADPLHAEL RITPQLQLSY IGLNVNQPPF DDPKVREAFY LLIDRVKLAD VSYNGSVVAA RGILPPGMPG AEPERLPEPN ADIARAKQLI SESSYGSVEK FPPIIGYSSG SGVGLLAQIA KDELGVTIEI RGQDQFGDYL AALERDNYHL YDLSWIADYP DPQNFLEVLF GSKGQYNRTN YSNAKFDQLI EQAKAEADAE KRGALYRQAE EQLLSDFVVI PLVHTVDYSL VKSYVDGYMI TALGELDLTG VSLKR
|
| |