Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1605 |
Symbol | |
ID | 5733507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1862688 |
End bp | 1864475 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278744 |
Product | extracellular solute-binding protein |
Protein accession | YP_001544376 |
Protein GI | 159898129 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTCC GTAAACCAGG TAAGCGTTCG ATTGGCTGGC TATTGCTCTT GGTATTGATG ATACCGTTGA TAGCTGCCTG TGGCGAAACC GCCGCTCCAA CCGCAACTGT TGGCTCAACT CCCGCTACTG GTGGCGAACC CACTGCTGCC CCAACTACCG CTCCTACAGA TGACACCGCT GCAGCCACAC CAACCGACGC CGCTGCTGCA ACCGAAGAAC CAACCATGGG CGATGCTGAT AAGTATCTGG TATTTGGTGG TTCGGGCGAA CCCGATTCAC TCGATTCGAT GGATACGACC ACGGGTACTG CCTTGATTGT AACCCGCCAA ATCCAAGAAT CGTTGTTGGG TTTCAAGGCT GGTACGTTGG AAGTTGTGCC CGAATTGGCG ACCAAGTGGG AGCCAAACGC CGATGCAACC GAGTGGACGT TTACCCTGCG CGAAGGGGTT AAATTCTCTG ATGGCACCGA CTTCAACGCT GATGCAGTAG TTTTCAACTT CCAGCGCTTG TTTGTGCCTG ATTTTGAGTT TGGCTTCCGT GCCGAAGGCA AGCAATACAA CATCGTGCCC GATATTTTTG GTGGCTATGC TGGCGACCCC AACAGTGCCT TCAAAGAAAT TATGGCAGTT GATCCAACCA CGGTTAAGTT TGTTTTGACG CGGCCTGTGC CGTTGTTGCC AAGCTATTTG GCCGCTTCCT ACTTCGGAAT TTCATCGCCC GAAGCAGTCA AGGCTGCCAA AGAAAAATAT GGCTCACCAG AAGTTGGTGG CGTTGGGACT GGCCCCTTCA AGTTTGAGCG CTGGGATGCT GGTCAAAGCA TCACCTTGGT GCGCAACGAA GATTATTGGG GCGACAAGGC CAAAATGCCA GGTGTGGTTG TGCGCTTTAT CGCCGAAGCA CCCCAACGTT TGGCCGAGCT TGAAGCTGGC ACAATTGATT TCACAATCAA CTTGAGCGCT GATAGCCGCG ATAAAATTGC TTCAAGCGCC GATTTGCAAG TGGTTGATTT GACTCCATTC AACATTGCCT ACTTGTCGTT GAACATGAAC AACAAGCCTT TTGACGATGT GCGGGTTCGT CAGGCGGTTG CCTATGCCAT CAACAAGCAA GAAATTCTTG ATGCCTCGTA TGGCGGCGTT GGCTCAATTG CCGACGACTT CTTGCCCGAT GGATTGGCTG AATATCGGGC GACTGACCTC GAACCATATG CTTATGATCC AGAAAAAGCC AAAGCCTTGT TGGCCGAAGC TGGCTATGCC GATGGCTTTA GCACCATGGT CTTGACCGAT GGAACTGAAT TGCCCTTGGA ATTGTGGTAT ATGCCGGTTT CACGGCCTTA CTACCCCGAT GCTAAGTCAG TGGCTGAACT CTACGCCGCC CAACTTTCCG ACGTTGGGAT CAAGGTTGAA CTCAAGACCG AAGATTGGGG CGTGTATCTC GATAACTGGG ATGCTGGCCT GAAAAACGGG ATGGTGATGT TGGGTTGGAC GGGCGACTAT GGCGACCCTA ACAACTTCTT GTTCACTCAC TTTGGCCCAG GCAACGCCGA CGAAGCTGGT TATACTAACG AGAAAGTTTG GCAATTGCTG GCCGATGCTG GTGGCGCTTC CTCGCCCGCC GAGTCAATTC GGCTGTTCCA AGAAGCTGGC AAGTTGATTA ACACCGATTT GCCACGGATT CCGATCGTGC ATGCTCCACC CGTATTGGCT GCTAAAAAAG CCTTGCAAGG CTGGGTGCCA AATCCAACTG GTGGCGAATC ATTCGCACCG ATCTCAATCA CGAAATAA
|
Protein sequence | MSFRKPGKRS IGWLLLLVLM IPLIAACGET AAPTATVGST PATGGEPTAA PTTAPTDDTA AATPTDAAAA TEEPTMGDAD KYLVFGGSGE PDSLDSMDTT TGTALIVTRQ IQESLLGFKA GTLEVVPELA TKWEPNADAT EWTFTLREGV KFSDGTDFNA DAVVFNFQRL FVPDFEFGFR AEGKQYNIVP DIFGGYAGDP NSAFKEIMAV DPTTVKFVLT RPVPLLPSYL AASYFGISSP EAVKAAKEKY GSPEVGGVGT GPFKFERWDA GQSITLVRNE DYWGDKAKMP GVVVRFIAEA PQRLAELEAG TIDFTINLSA DSRDKIASSA DLQVVDLTPF NIAYLSLNMN NKPFDDVRVR QAVAYAINKQ EILDASYGGV GSIADDFLPD GLAEYRATDL EPYAYDPEKA KALLAEAGYA DGFSTMVLTD GTELPLELWY MPVSRPYYPD AKSVAELYAA QLSDVGIKVE LKTEDWGVYL DNWDAGLKNG MVMLGWTGDY GDPNNFLFTH FGPGNADEAG YTNEKVWQLL ADAGGASSPA ESIRLFQEAG KLINTDLPRI PIVHAPPVLA AKKALQGWVP NPTGGESFAP ISITK
|
| |