Gene Haur_0261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0261 
Symbol 
ID5732156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp306391 
End bp307551 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID641277385 
ProductABC transporter periplasmic-binding protein 
Protein accessionYP_001543041 
Protein GI159896794 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000633057 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACGTT GGTTACTGAG TTTTTTAACA ATTGGTTCGT TGGTTTTGAC TGGCTGTGGT 
GCGGCGGCTA CGCCAACCGC TGAAACCGCG ATTCAGCCTA CTCCGGCCAC CACGGCTGCC
GCTGATGTTA CGCCTGCAGC AACCCAAGCC ACCCAAGCGC CGACCGATAG CGGCCAGTTT
GCCGGCCAAA CCTTGGTGGT GGTTTCGCAC GATAGTTTTG CAATCAGCAG CGAAGTTATC
TCAGGCTTTG AGCAAATGAC TGGCGCGACC GTCCAAATTC TAGAATCGGG CGATGCTGGC
GAGGCGCTCA ATAAGAGCAT TTTGGCCAAA GGCGCACCCT TGGGCGATGT GTTGTATGGG
GTCGATAATA CCTTTTTGAG TCGGGCGCTT GATGCTGATA TTTTCGAGGC CTACGCTGCC
AAGAATCTCG AGCAAATTCC TGCTGAATTG AAGCTCGATA GCCAAAATCG GGTTTCGCCC
GTTGATGTTG GCTATGTCAC GATCAATTAC GATAAAGCTG CCTTGGCCGA GGCTGGCCTG
AGCTTGCCAA CCGATTTGCG CGATTTAACC AAGCCTGAGT GGAAAGGCAA GTTGGTGGTT
GAAAATCCGG CAACGTCATC GCCTGGCTTA TCATTTATGT TGGCGACGAT TGCCCACTTT
GGCGAGAGCA GCGATTATAC CTGGCGCAAC TTCTGGAGCG ACCTGCGAGC CAATGAGATC
AAAGTGGCTA GCGGTTGGGA AGAAGCCTAC TATGGCGATT TTAGCGGAGC CAGTGATGGT
CAATATCCGT TGGTCGTCAG CTATGCGACC AGCCCTGCTG CCGAAGTTAT CTTTGCGACA
ACGCCCTTGA CCGATGCACC GACTGGCAAT TTGTTGTTAC CTGGTGGCGC ATTCCAGCAA
ATTGAATTTG TGGGCTTGCT AAAAGATGCC AAAAACCCTG AATTGGGCAA AGCTTGGATT
GACTATATGC TGAGCGACAC CTTCCAGAGC GATATTGGCG GACAGATGTT TGTGTACCCA
GCCCTGCCTA GCGCTAAAGT GCCAGCTGAA TTTACTCAAT ATGCCCAAGT GCCTAGCAAT
GTAACGACGC TTGCGCCAGC CGAGATTGCT GCCAACCGCG AGCGCTGGCT CAACGAATGG
CAAGAAACCG TCCTACGCTA A
 
Protein sequence
MKRWLLSFLT IGSLVLTGCG AAATPTAETA IQPTPATTAA ADVTPAATQA TQAPTDSGQF 
AGQTLVVVSH DSFAISSEVI SGFEQMTGAT VQILESGDAG EALNKSILAK GAPLGDVLYG
VDNTFLSRAL DADIFEAYAA KNLEQIPAEL KLDSQNRVSP VDVGYVTINY DKAALAEAGL
SLPTDLRDLT KPEWKGKLVV ENPATSSPGL SFMLATIAHF GESSDYTWRN FWSDLRANEI
KVASGWEEAY YGDFSGASDG QYPLVVSYAT SPAAEVIFAT TPLTDAPTGN LLLPGGAFQQ
IEFVGLLKDA KNPELGKAWI DYMLSDTFQS DIGGQMFVYP ALPSAKVPAE FTQYAQVPSN
VTTLAPAEIA ANRERWLNEW QETVLR