Gene Haur_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4604 
Symbol 
ID5736449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5887572 
End bp5888687 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content53% 
IMG OID641281766 
Productglycosyl transferase group 1 
Protein accessionYP_001547363 
Protein GI159901116 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000112819 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATTG CAATTGATGC TAGTCGTTTG GCGGTTGGGC AGCGCACTGG CACCGAATCA 
TATACAACTG AATTAGTAAG AGCGCTGGCT CAAACTGACC GTACCAACCA CTATTGTTTA
TATGTGAATC AACTCCCCGC AGCGTTGCCC CCACTAGGCC GCAATTGGCG GATCAAGCCG
ATTCCTGCGC CGCGTTTATG GACGCACTTG CGGCTTGGCC CAACATGGCA GATCGATCGG
CCAGATGTGG CATTTGTGCC AGCCCATGTT TTGCCAAGTT TGCCGCCGCG TCGGAGCGTT
GTGACGATTC ACGATTTGGG CTACGAACAC CACCCTGAAT CGCACCCCGC CCGCCAACGG
CTGTATTTGC GCTACTCAAC CTTGTGGAGC GCTCGTATGG CTAGCCAAAT TATTGCGATC
TCCGAAGCCA CCAAGCGCGA TCTGTTGCAC TACACAGGCA TTGCCGCCGA AAAAATTAGC
GTAATCTACC ACGGCGTGCA CGAGCGTTTT TATCCCCATT CGAGCGAGCA AACCAAGGCG
ACTGCCGCCA AATATGGCTT ACACGGCGAG TATTTATTAT TTATCAGCAC AATTCAGCCA
CGCAAAAATT TGGTGCGCTT GATCGAAGCC TATGCCCAAG CCCGCCAACG CTGCCCTGAT
TTGCCGATTT TGGCCTTGGG CGGCAAAACT GGCTGGCTAA CCGAACAAAT TACCCAACAA
GCTCAACAGT TAGGAATCAG CGAGCATGTG GCCTTTTTAG GCTATGTGGC CGACGACGAT
TTACCAGCGC TGCTCAGTGG TGCGACAATC TATCTATTGC CATCGCTCTA CGAAGGCTTT
GGCATGACCG TCTTGGAAGC CATGTCCAGT GGCGTTCCAG TGATTACCAG CAATGTAAGC
AGCCTACCTG AGGTTGCTGG CGATGCAGCC TTGTTGGTTG AGCCAAGCCA AACCGCTACA
ATTGCCGCAG CGATTGTTGA GCTTTGGCAA AACCCACAGC AACGCCACGA TTTTGCTCAA
CGCGGCTTAG CATGGGCCAA ACAATGGACA TGGCAGCGCT GTGCTGAACA AACTTTAGCA
GTTCTTACAA CGGTTGGGCA TCATGGCTCA TTCTAA
 
Protein sequence
MQIAIDASRL AVGQRTGTES YTTELVRALA QTDRTNHYCL YVNQLPAALP PLGRNWRIKP 
IPAPRLWTHL RLGPTWQIDR PDVAFVPAHV LPSLPPRRSV VTIHDLGYEH HPESHPARQR
LYLRYSTLWS ARMASQIIAI SEATKRDLLH YTGIAAEKIS VIYHGVHERF YPHSSEQTKA
TAAKYGLHGE YLLFISTIQP RKNLVRLIEA YAQARQRCPD LPILALGGKT GWLTEQITQQ
AQQLGISEHV AFLGYVADDD LPALLSGATI YLLPSLYEGF GMTVLEAMSS GVPVITSNVS
SLPEVAGDAA LLVEPSQTAT IAAAIVELWQ NPQQRHDFAQ RGLAWAKQWT WQRCAEQTLA
VLTTVGHHGS F