Gene Haur_0775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0775 
Symbol 
ID5732659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp876412 
End bp877449 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content54% 
IMG OID641277905 
Productglycosyl transferase group 1 
Protein accessionYP_001543551 
Protein GI159897304 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0449927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGA TTCTTGTTTG TACAGCCCAA GTGCCTTTTG CCCGTGGTGG AGCCGAGTTG 
TTAGCCGAAG GCCTCCTGCA AGCCTTGCGC AAGGCAGGCC ATGAAGCCGA TTTAGTCGCC
TTGCCCTTTA CTCGCACACC ACATCGCGAG TTGCTCAATA GCGCTTTGGC CTGGCGCATG
CTCGATCTCA GCCAGGTTGA AGATCGACCA GTCGATCAAG TAATATGTAC CAAATTTCCT
TCATATGCGG TGGCCCACCC CAAAAAAGTC GTTTGGTTGG TACATCAACA TCGGCAACTC
TACGATTGGC GCGGCACAAA CTGGAGCGAT TGGGGCAGTC AACCTGATGA TGATCAACTC
GCTCGCAGCC TGACGCGGCT CGATCAACAA GCCTTAGCCG AAGCCAAACG CCGCTTTAGC
ATCTCCAAAA TTGTCAGCCA ACGCTTGCAA CGCTTCAATG GACTCGCCAG CACCCCGCTG
TATCCACCGT CGATTTATAG CGGGCGCTTA CGTCAAGGCC GCTACGAACC GTATATTCTC
AGCATTTCGC GGCTTGACCC CGCCAAACGA CTCGATTTAT TGCTGCATGC CCTAACGCAT
ACCGAACAAC CAGTTAAGGC GATTATCGGC GGGCGCGGCC CAGCTTTGGT AGAACTCCAA
GGGCTAACCA AGCAACTTGG GCTTGAACAA CGGGTTGAGT TTCGCGGCTG GATGGATGAT
CAAACGCTGA TCGATGTATA TGCCGATGCC CGCGCCGTGT TCTATGCCCC GATCGACGAG
GATTTTGGCT TTGCCACGAT CGAAGCGCTT GAGGCGGCCA AGCCAGTGCT GACCGCCCAA
GATTCGGGCA CAGTTTTAGA ATTTATTCAC GATGGCACAA CCGGCTTTGT TGCGCCAGCC
GAACCGCGGG CCATGGCCGC CCGCCTCGAC GCATTGTGGG CTTCTGCCGA TTTAGCAGCC
CAACTTGGCA GCAATGGACC AGCGATGGTT GCCAACATTC GCTGGGAACA TGTCGTCAAT
CAATTAGTTT TAGCTTAA
 
Protein sequence
MKRILVCTAQ VPFARGGAEL LAEGLLQALR KAGHEADLVA LPFTRTPHRE LLNSALAWRM 
LDLSQVEDRP VDQVICTKFP SYAVAHPKKV VWLVHQHRQL YDWRGTNWSD WGSQPDDDQL
ARSLTRLDQQ ALAEAKRRFS ISKIVSQRLQ RFNGLASTPL YPPSIYSGRL RQGRYEPYIL
SISRLDPAKR LDLLLHALTH TEQPVKAIIG GRGPALVELQ GLTKQLGLEQ RVEFRGWMDD
QTLIDVYADA RAVFYAPIDE DFGFATIEAL EAAKPVLTAQ DSGTVLEFIH DGTTGFVAPA
EPRAMAARLD ALWASADLAA QLGSNGPAMV ANIRWEHVVN QLVLA