Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2397 |
Symbol | |
ID | 5734278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3053669 |
End bp | 3055462 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279538 |
Product | extracellular solute-binding protein |
Protein accession | YP_001545165 |
Protein GI | 159898918 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTAC GGTTTCCTGC GTTGATGTTG CTGGTTATTA TCTTTACCAG CATGTTGGCC GCATGTGGTG ATGCGTCAAC CCTCACACCA CAAGCAACTC AAGCACCAAG TACAACCGCC CCAGCCGCTA CTGCAACAGA AGCTCCTGCC ACGGGCAGCA CCGATCCAAC CGCCGAACCA ACTGCTGCTG CAACCACAGA TACCGGTTCC ACGAGCAGTG ATGGCAAACT ATTTTTGACG GTTTCAGACC AACAACAATC AACTTGGGTG CGCAATTTCA ACCCATTTGC TAGTGATAAC CGCTGGCCAA CTGCCGCCGG GATTTACGAG CCAATGTTTA TTTACAACAT TGCCACCGGT AAAATCGAGC CATGGCTCGC CACCGAATGG GCTTGGAATA GCGATAACAC CGAGTTGACC TTCACTATCC GCGATGGCGT GAAGTGGTCA GACGGCGAAG CATTTAGCTC AAAAGACGTT GCCTACACGC TCAATTTGAT GAAAGAAAAC GAAAAGCTGC AAGGCAATGG CCGCGCTGCA ATGCGTTTTG TTGATAGCGT CGCTGCCGAT GGCAATAAAG TTGTCGTGAA GTTCAGCGAA GTTTCAACCA TCGCGCTGTA TGATATTGGC CATCAAATGA TTGTGCCAGA ACATATCTGG AGCAAGATCG CTGATCCAGT GACCTTCACC AACGAAACTC CAGTTGCAAC CGGTCCATTC ACCGAAATCA TCCGCTTCCA AGACCAAATC TGGGAACTTG GCAAGAATCC AAACTACTGG CAAGCTGGCA AACCATATAT CGATGGTATT CGCCAACCAG CCTACCCCAG CAACGATGCC GCCAACTTGG CAACGATCAA TGGCGAAAAC GACTTGGCTA GCAACTTCAT TCCTGATGTT GAAAACACCT ATGTCGCCAA AGATCCTGAA AACAACCACT ACTGGTTCCC ACCAGTCAAT GCGCCAGTGA TGTTGCTCTT GAACACCACC AAGGCTCCAT TCAACGATCC TAACGTGCGC AAAGCGATCA GCATGGGCTT CGATCGCCAA CAAATTACTG AAATTGCTAC CTACAGCTAC AACGGCCCTT CAGATGCAAC TGGTTTGCCT GAATCGTTTG CTGATTGGAA AAACCCTGAA GCCGTTGCTG CTGGCGATTG GGTCAACTTT GATGTTGAAA AAGCCAATGC AATGTTGGAT GCTGCTGGCC TGACCCGTGG CGCTGATGGG ATTCGCGTAT TGCCCGATGG CACGCCAATG ACCTACGATA TCAACGTGGT TTCTGGCTGG ACCGACTGGG TGACCACCGA CCAAATCATT GCTGAAAGCT TGAAAGAAAT CGGCATCAAC GCCACTACCC GTACCTACGA CTTTAGCGCT TGGTTCGATA AAGTTCAAAA GGGCGAGTTT GATATGTCGA TTGGCTGGAG CAACAATGCC CCAACCCCAC TCCAATTCTA TCGTGGCTTG ATGTCAGGTG AAACCGCCGA TACCCCAATT GGCGAAGCCA ACGGCGACAA CTGGCACCGC TATGGTAATC CAAAGGTCGA TGAATTGTAC TCACAATTTG TCAAAACCTC AGACCCAGCC GAACAAAAGA AAATCATGAA CGAAATTCAG ATGATCTTTG TTCAAGAAGC CCCAGCTATC CCAATTATGC CAAACATCTA TTGGGGTGAA TACAACACCA AACGCTTTAC CAACTTCCCG AACGAAGAAA ACCCCTATGT CTTGCTTTCA TCGTTCGCCC AACCCGACCG CTTGATCCTC TTGACCAACA TCAAACCAAA ATAA
|
Protein sequence | MKLRFPALML LVIIFTSMLA ACGDASTLTP QATQAPSTTA PAATATEAPA TGSTDPTAEP TAAATTDTGS TSSDGKLFLT VSDQQQSTWV RNFNPFASDN RWPTAAGIYE PMFIYNIATG KIEPWLATEW AWNSDNTELT FTIRDGVKWS DGEAFSSKDV AYTLNLMKEN EKLQGNGRAA MRFVDSVAAD GNKVVVKFSE VSTIALYDIG HQMIVPEHIW SKIADPVTFT NETPVATGPF TEIIRFQDQI WELGKNPNYW QAGKPYIDGI RQPAYPSNDA ANLATINGEN DLASNFIPDV ENTYVAKDPE NNHYWFPPVN APVMLLLNTT KAPFNDPNVR KAISMGFDRQ QITEIATYSY NGPSDATGLP ESFADWKNPE AVAAGDWVNF DVEKANAMLD AAGLTRGADG IRVLPDGTPM TYDINVVSGW TDWVTTDQII AESLKEIGIN ATTRTYDFSA WFDKVQKGEF DMSIGWSNNA PTPLQFYRGL MSGETADTPI GEANGDNWHR YGNPKVDELY SQFVKTSDPA EQKKIMNEIQ MIFVQEAPAI PIMPNIYWGE YNTKRFTNFP NEENPYVLLS SFAQPDRLIL LTNIKPK
|
| |