Gene Haur_5097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5097 
Symbol 
ID5737055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp123439 
End bp124461 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content49% 
IMG OID641282262 
Productglycosyl transferase family protein 
Protein accessionYP_001547853 
Protein GI159901607 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.912612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTAA GTATCATTAT TGTCAGTTAT AACAGTCAGG ATGATCTCGT TGAGTGTCTC 
GACTCCGTGA TCCAAGCCTG TCCTGATCAA ACACGGTATG AAATTCTTGT TGTTGATAAT
GACTCACAGG ATGCAAGTCG TCTCGTCGTC CAGCAGCAGT ACCCGATGGT TCGACTCCTT
GAAAACAGTA ATACTGGCTA TGCAGGGGGG AATAACTATG GTGCAGCGAT GGCGCGTGGT
GAATACCTTC TCTTTCTTAA TCCCGATACG GTGGTCATGC CAGGTGCCAT TGATGCGCTC
GTAGCCCCCT TCAAGACTGA TCCGACCATT GGTCTTACCA CGGCATGTCT TGTCCACCAT
CAGCACCCCC AATCCATTAA TGCCTGTGGA AATGAGATGC ACTATACCGG GTTAACGTAC
TGTCGAGGTG CCAATCAACC CCGGACTGCC TATCAAACAA GTTCCTATGT TGATGCCGTG
TCAGGCGCTG CATGTGCGAT CCGTCGCAGC CTATTTACCA CGTTGGGGGG GTTTGATCAG
CAGTTTTTTA TGTATGTCGA AGATAGTGAT TTATCGTTAC GGGTGCGGCT CTATGGATTG
CAGTGTTTTT ATGTTGCGGA TGCGGTTATC CAGCATAAGT ATCACATGAA GTATACCGCC
CAAAAAGCAT TTTTGATTGA ACGCAACCGC TATTCGATGC TCATTAAAAA TTTCTCACCG
AGCGTCTTAG GTCGCTTACT GCCAGGACTT CTTCTGGCCG AAGTGATTAC CGGAAGTTAT
TTTTTGCTGC GTGGACCACA GTATTGGAGT ATCAAACCGC GACTCTATCA GCATATCTGG
CGGTATTGGA GGACTACATC GACACCAGCG ACCAGTCCGA TCCAAGAACT GGCCGTTGTC
AAAGCATTAA CCAGTCAGCT TAATTTTCAG TCACTCCATC AGGGCAGGGT TACGCGGTTG
CTTGCAGGGA TGGTCAATCC GCTATTGGGG CTGGCTCATC GATTTGCAGG AGGGTGGGCA
TGA
 
Protein sequence
MDVSIIIVSY NSQDDLVECL DSVIQACPDQ TRYEILVVDN DSQDASRLVV QQQYPMVRLL 
ENSNTGYAGG NNYGAAMARG EYLLFLNPDT VVMPGAIDAL VAPFKTDPTI GLTTACLVHH
QHPQSINACG NEMHYTGLTY CRGANQPRTA YQTSSYVDAV SGAACAIRRS LFTTLGGFDQ
QFFMYVEDSD LSLRVRLYGL QCFYVADAVI QHKYHMKYTA QKAFLIERNR YSMLIKNFSP
SVLGRLLPGL LLAEVITGSY FLLRGPQYWS IKPRLYQHIW RYWRTTSTPA TSPIQELAVV
KALTSQLNFQ SLHQGRVTRL LAGMVNPLLG LAHRFAGGWA