Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1897 |
Symbol | |
ID | 5733786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2288693 |
End bp | 2290765 |
Gene Length | 2073 bp |
Protein Length | 690 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279041 |
Product | extracellular solute-binding protein |
Protein accession | YP_001544668 |
Protein GI | 159898421 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000383854 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAAGA AGCCAAAAAC CCTGTGGAGC TTGATTGCGC TCCTGATTCT CGTCTTGCCA CTCTTAGCAG CCTGTGGCGA TGCCGCCTCA ACTGCAACCA CCGCGACCGA CTCAACCCAA CCAACTACTG CTAGCGATTC TTCAGCAACA ACTACCGCTC CAACCGCCGG CACTGATTCC CAGCCGACAA CTGGCAGTAC AGATGTTAAA GATGTGACGG TAACCTTACC ATTCCAACAA GGTGGCATTA CCAGTCTTGA CCATGCCTAC TGGTCAGCTC AATTGTTTGT CTCACAAGGC GTTGTGTTCG AAGGCTTATA TGGCTACGAT CAACAATTGA ACATCGTGCC AAAAATTGCC GAAAGCGCTA CTCCTTCAGC CGACAACCTC GTTTGGACGA TCAAACTGCG TAAAGACAAA AAATGGTCGA ATGGCGACCC CGTTACCGCT AAAGATTTCT ATGCTGCTTG GGTTCGAGCA GCTGGCCCAG AATTAAAAGA TGCTCCAATG TGGGCAGGCA TGATGGGCTA TGTCAAAAAT GGTTATGCCT TCAAGGCTGG CGCAGTTGGT GTCGATGAAG TCGGCTTCAA ATTGATCGAC GATTACACGA TCGAAGTGAC CTTGAGCCAA CCAAACGCTG CATTTATCAA CTTCTTGCCA GTTATGAACG CCATGCCAAT CAATGCCAAG AGCTTGGAAG CCAACCCTAG CGATTGGTTT GACCCCAAGA ATGCGGTCTA TAACGGCCCA TACATTGTTG AATCGTGGGT CAGCGGCGGC GATATTGTGC TCAAGCGCAA CCCCAATTAT GTTGGCGAAG GCACTGGCAA CGTTGGCACG ATCGTGTTGC GGCCTTACGC CGATGCTAAT GCTCGCTTGC AAGCCTTTGA AAACCAAGAA ATTCAATTCA CCTTCCTTGA AGATGCTAGC CAATTGGCCT ACGCTCAAGG CAACCCCGAC ATCAAAGACA ACATCAAGAG CGAAGAAAAC CCAATCGTTT GGAAGGGTAT TCAATACAAT CGTTCATTGA ATGCTGGCCC ATTGCAAGAT GTGCGGGTGC GCCAAGCCTT CGCGTTGTCA ATCGACAAAA AAGCTATCAC CGACCAAATT TTGAAGGGCT TGGCAGTCCC AACCGATACC TTCAACAGCG ACGTACGGGT CAAGGATAAA GTTAAGAGCT TAAGCTATGA TGTCGCTAAA GCCAAGCAAT TGCTGGCCGA AGCTGGCTAT CCCGATGGTG CTGGCTTCCC TGAATTGACC TTCTACGCGC CACCATCAAA CGACCCACAA ATGCCTTGGA TTGAAGCCGT CGCCAAGATG TGGCAAGATA ATTTGGGCGT GAAGGTTGTG ATTCAAAACA ACGAAGGTCC AGTTTACTCA AACATCCAAT GGTCGAACTA TAACAAAGAT ATTCAACCAG GCTTTGCGAC CCTTGGCGGC CCAATGAACT GGTTCCAACC ACTCGACTTG ACCTTGGCTA CCAACCATAT TTGGTACTTC ATGGACTTCA AAGAAGGTGG GATGGCTAAG TTCGCTGAAT ACCAAACCCA AATCGATGCT GTCAGCAGCA CCGAAGCCGT TGGCGATTGG GCTGATTTGA CCGCTCGTGC TGAAGCTGCT TGGACTAAGC GCCAAGCTAT CAGTAAAATT GAAGAGCCAG AAATGGCTAA AATTTCAGCT AAAGCTCCTG AAGCTCCCAG CTTCAAAGAT CAATGGGATG CAATCAACGA ACGCTTTACT GCTGCTAAAG ATGATGCTGA AAAGCTGACT GCCTACAAAG ATGGCTTGTT GTTGGTCTTG AAGGAAGAAC AAGACGCAAC GTGGTATGAT TATCGTACCG AAGAAAACAA ACAAGCCCAA CGCTTGATGA TCAAACTGGC TGGCCAAACG CTCGATGACG CTTGGCAAAC CTTGCCCGAA ATTCAACAAT TGGCAATCGA CTCAGCTTGG ATGATCCCCA TCTACTACGA CAAGTTGTAC TACGCAACCG ATCCACGACT CTCAGGCATT GTGATCAACA AACTCTCGTG GGGCGGGATC TTCCAATATC AATATTTACA GTGGACTGAA TAA
|
Protein sequence | MVKKPKTLWS LIALLILVLP LLAACGDAAS TATTATDSTQ PTTASDSSAT TTAPTAGTDS QPTTGSTDVK DVTVTLPFQQ GGITSLDHAY WSAQLFVSQG VVFEGLYGYD QQLNIVPKIA ESATPSADNL VWTIKLRKDK KWSNGDPVTA KDFYAAWVRA AGPELKDAPM WAGMMGYVKN GYAFKAGAVG VDEVGFKLID DYTIEVTLSQ PNAAFINFLP VMNAMPINAK SLEANPSDWF DPKNAVYNGP YIVESWVSGG DIVLKRNPNY VGEGTGNVGT IVLRPYADAN ARLQAFENQE IQFTFLEDAS QLAYAQGNPD IKDNIKSEEN PIVWKGIQYN RSLNAGPLQD VRVRQAFALS IDKKAITDQI LKGLAVPTDT FNSDVRVKDK VKSLSYDVAK AKQLLAEAGY PDGAGFPELT FYAPPSNDPQ MPWIEAVAKM WQDNLGVKVV IQNNEGPVYS NIQWSNYNKD IQPGFATLGG PMNWFQPLDL TLATNHIWYF MDFKEGGMAK FAEYQTQIDA VSSTEAVGDW ADLTARAEAA WTKRQAISKI EEPEMAKISA KAPEAPSFKD QWDAINERFT AAKDDAEKLT AYKDGLLLVL KEEQDATWYD YRTEENKQAQ RLMIKLAGQT LDDAWQTLPE IQQLAIDSAW MIPIYYDKLY YATDPRLSGI VINKLSWGGI FQYQYLQWTE
|
| |