Gene Haur_3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3581 
Symbol 
ID5735442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4502784 
End bp4503920 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID641280730 
Productglycosyl transferase group 1 
Protein accessionYP_001546345 
Protein GI159900098 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGTTG TTCAGGTGAT CGATTCGCTC TATCGTGGTG GAGCACAACA ATTGCTGGTA 
ACCTTCGCGA TCGAAGCCCA ACGCCGTGGG ATCAAAACCA GCGTTGTTTG TCTCAAGGAT
GAAGATCGCG GCAGCAACTT GGTTGAACGG CTGAATGGTT TGGGCGTTGA AGTGCTGCGT
TTGGCTGCGC CCAAAATGCT GGCCCCTAAA CGAATCTGGC AATTAACCCG TTGGCTGCGG
CGCAATCAGG TCAGTGTGGT GCATACCCAT TTGACCTATG GCAATGTCGT AGGTATTTTG
GCAGCCCGTT TGGCTAATAT TCCTGTGGTG GCGACGATGC ACTTAGCGGG CTTCGATCCA
TCAATTGCCA ATCGCCAACA GCAGTTTGAG GCCCAAGTGG TACAGCGTTT GGCGCAGCAA
ATTATCGCTG TTGGCTATAC CACTCGCGAT GCCTACCAGC CAATTATGCC CAATCGCCAG
CTGCATGTGG TGCATAATGC CGTGGTAGCC GTGCCAGAAA TTAGCCCTGA GCAACGCCAA
ACCACGCGCG AAGCAGTGTT GGGCGACCCA AATTTGACCA TGTTGATCAA CGTTGGGCGT
TTTGCGGCAA TCAAAGATCT GCCAACCTTG ATCGATGCCT TTGCCTTGGT GCATGCTCAA
CATCCCCAGG CGCGGCTCGT TTTGGCGGGC GAAGGCGATC AACGACCCAA AATCGAAGCC
AAAATTAACG CACACCAATT GCCTGCAGTG GTCAATTTGC TTGGCGCACG CGATGATATT
CCAGTGCTAT TGCGTAGCGC CGATTTGTTT GTCAATTCGT CAGCCAACGA AGGATTGCCG
ATCGCCGTGC TCGAAGCCAT GGCCGCAGGC TTGCCGATTA TTGCCACCAA AGTTGGCGAC
GTGCCGCATG TGGTTCGCGA ACAAGCGGGC ATTACGGTTG CGCCGCATGA TCATCAAGCC
TTAGCTGCTG CAATTAATCA AGTTTTGAGC GAGCCAAGCC AGATCCAGGC GATGCAACAG
GCTGCTCAAC AAATTATCGA GCAATACCAT AGCCCTAGCG CGTGGGTTGA TCAATTATTA
AGTTTGTATA CCGCCGCCCA CGAGGGCGTT GACCAGCGCG AGGCTGTCTC GGCATGA
 
Protein sequence
MHVVQVIDSL YRGGAQQLLV TFAIEAQRRG IKTSVVCLKD EDRGSNLVER LNGLGVEVLR 
LAAPKMLAPK RIWQLTRWLR RNQVSVVHTH LTYGNVVGIL AARLANIPVV ATMHLAGFDP
SIANRQQQFE AQVVQRLAQQ IIAVGYTTRD AYQPIMPNRQ LHVVHNAVVA VPEISPEQRQ
TTREAVLGDP NLTMLINVGR FAAIKDLPTL IDAFALVHAQ HPQARLVLAG EGDQRPKIEA
KINAHQLPAV VNLLGARDDI PVLLRSADLF VNSSANEGLP IAVLEAMAAG LPIIATKVGD
VPHVVREQAG ITVAPHDHQA LAAAINQVLS EPSQIQAMQQ AAQQIIEQYH SPSAWVDQLL
SLYTAAHEGV DQREAVSA