Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0261 |
Symbol | |
ID | 5732156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 306391 |
End bp | 307551 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277385 |
Product | ABC transporter periplasmic-binding protein |
Protein accession | YP_001543041 |
Protein GI | 159896794 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4143] ABC-type thiamine transport system, periplasmic component |
TIGRFAM ID | [TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000633057 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACGTT GGTTACTGAG TTTTTTAACA ATTGGTTCGT TGGTTTTGAC TGGCTGTGGT GCGGCGGCTA CGCCAACCGC TGAAACCGCG ATTCAGCCTA CTCCGGCCAC CACGGCTGCC GCTGATGTTA CGCCTGCAGC AACCCAAGCC ACCCAAGCGC CGACCGATAG CGGCCAGTTT GCCGGCCAAA CCTTGGTGGT GGTTTCGCAC GATAGTTTTG CAATCAGCAG CGAAGTTATC TCAGGCTTTG AGCAAATGAC TGGCGCGACC GTCCAAATTC TAGAATCGGG CGATGCTGGC GAGGCGCTCA ATAAGAGCAT TTTGGCCAAA GGCGCACCCT TGGGCGATGT GTTGTATGGG GTCGATAATA CCTTTTTGAG TCGGGCGCTT GATGCTGATA TTTTCGAGGC CTACGCTGCC AAGAATCTCG AGCAAATTCC TGCTGAATTG AAGCTCGATA GCCAAAATCG GGTTTCGCCC GTTGATGTTG GCTATGTCAC GATCAATTAC GATAAAGCTG CCTTGGCCGA GGCTGGCCTG AGCTTGCCAA CCGATTTGCG CGATTTAACC AAGCCTGAGT GGAAAGGCAA GTTGGTGGTT GAAAATCCGG CAACGTCATC GCCTGGCTTA TCATTTATGT TGGCGACGAT TGCCCACTTT GGCGAGAGCA GCGATTATAC CTGGCGCAAC TTCTGGAGCG ACCTGCGAGC CAATGAGATC AAAGTGGCTA GCGGTTGGGA AGAAGCCTAC TATGGCGATT TTAGCGGAGC CAGTGATGGT CAATATCCGT TGGTCGTCAG CTATGCGACC AGCCCTGCTG CCGAAGTTAT CTTTGCGACA ACGCCCTTGA CCGATGCACC GACTGGCAAT TTGTTGTTAC CTGGTGGCGC ATTCCAGCAA ATTGAATTTG TGGGCTTGCT AAAAGATGCC AAAAACCCTG AATTGGGCAA AGCTTGGATT GACTATATGC TGAGCGACAC CTTCCAGAGC GATATTGGCG GACAGATGTT TGTGTACCCA GCCCTGCCTA GCGCTAAAGT GCCAGCTGAA TTTACTCAAT ATGCCCAAGT GCCTAGCAAT GTAACGACGC TTGCGCCAGC CGAGATTGCT GCCAACCGCG AGCGCTGGCT CAACGAATGG CAAGAAACCG TCCTACGCTA A
|
Protein sequence | MKRWLLSFLT IGSLVLTGCG AAATPTAETA IQPTPATTAA ADVTPAATQA TQAPTDSGQF AGQTLVVVSH DSFAISSEVI SGFEQMTGAT VQILESGDAG EALNKSILAK GAPLGDVLYG VDNTFLSRAL DADIFEAYAA KNLEQIPAEL KLDSQNRVSP VDVGYVTINY DKAALAEAGL SLPTDLRDLT KPEWKGKLVV ENPATSSPGL SFMLATIAHF GESSDYTWRN FWSDLRANEI KVASGWEEAY YGDFSGASDG QYPLVVSYAT SPAAEVIFAT TPLTDAPTGN LLLPGGAFQQ IEFVGLLKDA KNPELGKAWI DYMLSDTFQS DIGGQMFVYP ALPSAKVPAE FTQYAQVPSN VTTLAPAEIA ANRERWLNEW QETVLR
|
| |