Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4018 |
Symbol | |
ID | 5735879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5125407 |
End bp | 5127068 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281168 |
Product | extracellular solute-binding protein |
Protein accession | YP_001546778 |
Protein GI | 159900531 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.112501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGAA TATTGACCCT AGGATTATTG GCTGTTTTGC TCACAGCATG TAGCCTCGGC TCCACACCAG CACCCACCAA CGAGCCAACC ACACCGCCAA TTCAGGTTCC GCAAGGTGGT ACGTTAACCA TTCGCACGGC CCAAGATATT GCAGCTTTGC ACCCGTGGAA ACCAACCTGC CACGAAGAAG CTCAACTTTT AGGCCTGCTC TATCGTGGTT TAACTAAGCT TGATCAAAGC CTCGCTCCGC AACCTGATGT TGCTACAAGC TGGCAAAGCG ATAGTATCGG CCAAACCCTG ACCATGACCT TGCGCAGCGA TATTCGTTGG CATGATGATA CGACCTTGAC TGCTGCTGAT GCGGCTTGGA CGATCAGCGC CATGCAAAGC ATCAGCCCAA CCACGCCATT ATTGACCGAT CTTCAGGGCT TGGTGCGCAA GGTTACTGCA CCCGATGACA CAACTTTGGT CATTTCGTTG CGCGAACCCT ATGCGCCATT GCTCTCGGCC TTGAGCATGC CAATTTTGCC CAAACATGTG TTTGAGCAAT TAAGCCCAGT TGAGCTTGAT CAGCTTAATC TTTTGACGCA GCCAATTGGT AGCGGCCCGT TTATGTTCGA GCAACGGACT GCTGGCTCAG CAATTAGCTT AATTCGCAAT AGCAACTATA TCGATGGCGT GCCCTATCTC GATCGGGTGG CCTTTGTGGT TGCCCCCGAT CCGCAAGTGG CTCGCCAAGC AGTGCGCGAT GGAGATTTAT TGGCAGCAGA ATTGCCATGG GCACAAAGCC AAGGCTTAGG GCCAACAGTT GGCATAGGTA GTTATCCTGA AAATGGTTTT TATTATTTGG CCTTTAATAT GCGTGATGGT CGCATCTTCA GCGATCAGCG AGTGCGCCAA GCGCTGGCAT TAAGCCTCGA TCTCAATACA ATTGTCGAAA CCGCTGGCCC AGCCGCTCAA GCAATTTTGA GCGATCATTT GCCTGGCACA TGGGTTGCGC CAACTGGTGA GTTGCCCAAA CGCAACTTAG ATCAAGCTCG CGAATTACTG GATCAAGCAG GCTGGGTCTT GCCCGAAGGT GCGACAATTC GCGCCTCGAA CGGGATTACG CTGTCGATGG CGCTGTTTGT GCGGGGCGAT GACCAACGCC GCATCGAGGT TGCCGAACGG ATCGCCGCTG CTACACGTCC GGTTGGCTTC AATATTGTGG TTACGCCAGC CGATTTCGAG AGCGTGATTC GCTCTAAATT GGTAACACCC TTTGATTTTG ATTTGGCTTT GATGAGTTGG GGCAATAGTC GAGTTGGTGG TTCGCCCTCG TATACCGCCT ACGATCCTGA TAATTTCTCG TTGTTTCATT CGAGCCAAAT TTACCAAGGG GTTGCCGATG GACGGCCTGG CCTGCGCAAT TATGGTGCGT TTCAAAACAC CAGTTTCGAT AATTTATCGA CGGCAGCGCG GGCACTTTAC GCAACTGAAC GCCGCCGCGA ACTCTATCAA CAAACCAACA CAATCATTCA AACCGAATAT CCGTATGTGT TTCTGTGGGC CGATCGGATT CCGGTCGCCT TAGCCAAACA GGTGCGTTCA ACTCAAGGCG AAATTCGGTT GGATACGGCC AATTGGCTGT ATGATGTTCA ACATTGGTAT CTTGAGCAAT AA
|
Protein sequence | MMRILTLGLL AVLLTACSLG STPAPTNEPT TPPIQVPQGG TLTIRTAQDI AALHPWKPTC HEEAQLLGLL YRGLTKLDQS LAPQPDVATS WQSDSIGQTL TMTLRSDIRW HDDTTLTAAD AAWTISAMQS ISPTTPLLTD LQGLVRKVTA PDDTTLVISL REPYAPLLSA LSMPILPKHV FEQLSPVELD QLNLLTQPIG SGPFMFEQRT AGSAISLIRN SNYIDGVPYL DRVAFVVAPD PQVARQAVRD GDLLAAELPW AQSQGLGPTV GIGSYPENGF YYLAFNMRDG RIFSDQRVRQ ALALSLDLNT IVETAGPAAQ AILSDHLPGT WVAPTGELPK RNLDQARELL DQAGWVLPEG ATIRASNGIT LSMALFVRGD DQRRIEVAER IAAATRPVGF NIVVTPADFE SVIRSKLVTP FDFDLALMSW GNSRVGGSPS YTAYDPDNFS LFHSSQIYQG VADGRPGLRN YGAFQNTSFD NLSTAARALY ATERRRELYQ QTNTIIQTEY PYVFLWADRI PVALAKQVRS TQGEIRLDTA NWLYDVQHWY LEQ
|
| |