Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2657 |
Symbol | |
ID | 5734552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3410085 |
End bp | 3411938 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279799 |
Product | extracellular solute-binding protein |
Protein accession | YP_001545423 |
Protein GI | 159899176 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000297428 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGAAC GAAGCAAACG CTTAGCCGCC ATCGTGCTTA CTGGCGCGAT TTTGGCAGCA TGCGGTAGCG GCACCAGCAC CACACAACCA ACCACTGGCA GCAGCGAGGC AACCGCCGTC GTTGAGCCAG GTACTGATAC TGGGTCGCAA CCCAGCAGCG ATGGCATTGT AATCATTGGG ATGAGCCAAT CGCCAGACAC CTTATTTGGG ATGGAATCGC AATCGAGCGC CACCACCCAA GTGTTAAACT CAGTGCAACC AGCCTGTGTA ACGACCCTCA GCTACGATTA TCAACCAGTC TGTTTCGCTG AATTGCCCAC CTTTGAAAAT GGCGGCGCAG TCGAAGAAAT GGTAACAGTT GATCAAAGCT ATACCGGCCC ATTCGTGATC GAGAATGAAC TGATCACTGA TACCAGCGTT TTGACTGGGC CAATCGAATT GCCCCAAGTT AAAGTGACAT GGAAGTTGAT CGATGGCATT ACTTGGGAAG ATGGCACACC AGTAACCGCC GATGACTTTT TGTTTGCTGC TGAATTGTAT GCTAACCCAG GCACCAAAGT CGCCAGCCGC TTCACCATCG AGCACACCGC CAAATACGAA AAAGTTGATG ATCAAACTTT CTCATGGTAT GGCGTGCCAG GCTATCAAGA TTCAACCTAT TTCTTGAACT ACGCAGGTGG CGCACTTGGC CCTGAACCCA AGCATGTGCT CGGCAGCGTC GATGCAGCAA CCATCGGCGG TAGCGATTAT TCCAGCAAAC CGTTAGCTTA TGGCCCGTAC AAAATTGACG AATATGTGCC GCAAGAACGG GTGACAATGA GTGCTAATCC ACATTATTGG GGAGCAAAGC AAGGCCTACC CAAAATTAGC AATGTAATCT ATAAATTCGT CACCAGCGAA GATCAAATTT TGCAACAATT GCAATCAGGC GAAATCGATG TCGTTGGCCA AATTGGCTTA TCGCTAGCTC AAGCGCCATC ACTTGATGAA TTGCAAGCAG GCGCTGAATT CGATATTCAA TATGTGCCAG CCACGGTTTG GGAACATATT GACTTTGCTG TTGAGCGCGG CGATGGCGTT GCAACCCCAT TTGCTGATGC CAAAGTGCGT CAAGCAGTCG CCTATGCAAT CAACCGCCAA GAAATCATCG ATCAGATCTT GTTTGGTAAA ACCGTCGCAA TGAACAGCTT CATGCCCGAT GATCACTGGG CCTATCCCAG CGATGCCAGC GTGATCAACT CTTATGCCTT TGATCCCAGC AAAGCGATTC AATTGCTGAA TGAAGCTGGT TGGGTTGCTG GCGATGATGG CATTTTGGTC AAAGATGGCG AACCATTCAA GGTTGAATTC TTCACCACCG AAGGCAACGA TACCCGTCAA GCAGTCGCCC AATTGGTGCA AGAATACTTG CGCGATGTCG GGATCGACGT TGAATTAAAG TTTGTCGCTG GCACTGATGT GCTGTTCAAA AATGGCTCGG AAGGTATTTT GGCAGGTCGC CGCTACGATA TGGCATTGTA TGCTTGGGTC AGTGGCCCTG AGCCATCAAC GCCGTTGTAT CTGTGCAGCC AAGTGCCAAC AGCAGCCAAT GGCTACGCTG GCCAAAACAA CACTGGCTAC TGCAACCCCG ATTACGATAA AGTAGCGCTC GAAAGCCAAA GCATCATCGA ACGGGCTGGC CGCTTGCCAT TGCTTAGCCA AGCCCAACAA ATCTTCAACC GCGATTTGCC AACCTTGCCG TTATATCAAC GGATCAACGT TGGGGCTGCC CGCAAAACAA TCAGCGGCTT CAAGCTCGAT CCAACCAGCC AACAAGATTT CTACAACATC GAAACCTGGG AATTAGCCCA ATAA
|
Protein sequence | MFERSKRLAA IVLTGAILAA CGSGTSTTQP TTGSSEATAV VEPGTDTGSQ PSSDGIVIIG MSQSPDTLFG MESQSSATTQ VLNSVQPACV TTLSYDYQPV CFAELPTFEN GGAVEEMVTV DQSYTGPFVI ENELITDTSV LTGPIELPQV KVTWKLIDGI TWEDGTPVTA DDFLFAAELY ANPGTKVASR FTIEHTAKYE KVDDQTFSWY GVPGYQDSTY FLNYAGGALG PEPKHVLGSV DAATIGGSDY SSKPLAYGPY KIDEYVPQER VTMSANPHYW GAKQGLPKIS NVIYKFVTSE DQILQQLQSG EIDVVGQIGL SLAQAPSLDE LQAGAEFDIQ YVPATVWEHI DFAVERGDGV ATPFADAKVR QAVAYAINRQ EIIDQILFGK TVAMNSFMPD DHWAYPSDAS VINSYAFDPS KAIQLLNEAG WVAGDDGILV KDGEPFKVEF FTTEGNDTRQ AVAQLVQEYL RDVGIDVELK FVAGTDVLFK NGSEGILAGR RYDMALYAWV SGPEPSTPLY LCSQVPTAAN GYAGQNNTGY CNPDYDKVAL ESQSIIERAG RLPLLSQAQQ IFNRDLPTLP LYQRINVGAA RKTISGFKLD PTSQQDFYNI ETWELAQ
|
| |