Gene Haur_0774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0774 
Symbol 
ID5732658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp875254 
End bp876411 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content55% 
IMG OID641277904 
Productglycosyl transferase family protein 
Protein accessionYP_001543550 
Protein GI159897303 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00080815 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCCTG TTGTGATTGG GCTGCTTTTA GGGCTAGCCT CATTGATTTT ATTGGTGCGT 
GATGCCTTGT TGCTCAACAA GATTCCCAAG ATTGAGCCAC GACCAAGCCC TGAACCTATG
CCCAGCGTTG CTGTGTTGGT GCCAGCCCGC AATGAAGCGC AAAATATTGG CCATGTGTTG
CGTGGCATGG CTCAGCAAAC CCGTAGTGAT TGGCAACTAA CCATTCTTGA TGATCATTCA
ACTGATGCCA CGGCTGCGAT TGTAGCCGAT GTTGCGGCGC AGGATCAACG GGTACATTTG
CTGCAAGGCC AGGCATTGCC CGCTGGCTGG ACAGGTAAGT GCTGGGCATG TTGGCAATTG
GCCGAGGCTA GCACTAGCGA ATGGTTGCTG TTTCTTGATG CTGATACCAA GCCGCAGCCT
GAGATGTTGC AACAAGCCCT AGCCTATGCC GAGGCCGAAA AACTCGATCT GCTGACGTTT
TTGCCCTTCT CGGAGCTAGG CAGTTTTTGG GAGCAAACTT TGCTGCCAGC CTTTTTCTCA
ATCATTCAGG CGGCCTATCC GGTCAGCAAA GTTAATACGC CTGGTTCGGG CGTGGTGCTG
GCGAATGGTC AATTTATTTT GGTGCGACGC AGCGCTTACC AACGAGCAGG CGGCCATGCG
GCAGTGCGTG ATCGAGTGCT TGAGGATGTT GAGTTAGCCC AAGCGATTGT GCGGGCTGGC
GGGGTGATGC GGGCGGTGTA CGCTGGTGAG TTGTTGCGCG TGCGAATGTA CACCAAGGGC
AGCGAAGTTC GCGAGGGCTT GGTCAAAAAT GCGATTGCGG GCTTACGTAA TGGTGGCGTG
CGTTCTTCAT GGGCTGGTTT GCGCCAGATT TTGGTTGGAG TTGTGCCATT CGGGCTGGGT
TTGCTAAGCC TATGGGCTTG GCTGCGGCGC TGGACATTCT GGCCAAAAAT CTTATTGAGT
GCGATGGCGG TGCTGCTCAA TGGCTTTGCT TTGTGGAGTT GGGGCCGTTT TATGCAGCAA
TTGTATGGCT TATCGCGCCG TCACGCCCTG CTTTTTCCCT TGGGGATTGT CTGCTATATG
CTGCTGGCGG CTGAAGCGGC TTGGCGGATC TGGTCGGGGC GCGGGGTGAC GTGGAAAGGC
CGCACCTACA AAGAGTAA
 
Protein sequence
MLPVVIGLLL GLASLILLVR DALLLNKIPK IEPRPSPEPM PSVAVLVPAR NEAQNIGHVL 
RGMAQQTRSD WQLTILDDHS TDATAAIVAD VAAQDQRVHL LQGQALPAGW TGKCWACWQL
AEASTSEWLL FLDADTKPQP EMLQQALAYA EAEKLDLLTF LPFSELGSFW EQTLLPAFFS
IIQAAYPVSK VNTPGSGVVL ANGQFILVRR SAYQRAGGHA AVRDRVLEDV ELAQAIVRAG
GVMRAVYAGE LLRVRMYTKG SEVREGLVKN AIAGLRNGGV RSSWAGLRQI LVGVVPFGLG
LLSLWAWLRR WTFWPKILLS AMAVLLNGFA LWSWGRFMQQ LYGLSRRHAL LFPLGIVCYM
LLAAEAAWRI WSGRGVTWKG RTYKE