Gene Haur_2408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2408 
Symbol 
ID5734289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3070063 
End bp3071406 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content48% 
IMG OID641279549 
Productglycosyltransferase family 28 protein 
Protein accessionYP_001545176 
Protein GI159898929 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGA TTGTTATCAG TATGTTTCCC GAAGAAGGCC ATCTTATTCC TAGTTTCAAG 
CTTGGCAAGA GCTTAAAAGC CCAAGGTCAT CAGGTTTATT ATTTAGCCTT GGCCGATTTT
GAGGAGTATA TTCGCAAGCA GGGTTTTGAA TATTTGCCCT TGTTTGCTGA GGATTTTCCC
AAGGGCTTTC GTGCTCAGCA AACCGAGCGA ATTGCGAACA CCCGTGGCCG AGCCTTTTTG
AAAGAGGTTT CACAAACGGC GTTTTATCTC AAGTTGTTTC AAACTGTACA TGAAGATCAA
AATCCAATCA AGCGTAGTTT GTTGGAAATT GGGACAGATT TATGTCTTTT TGATGGATTT
TTGGCCCCGC TGAGCCTTAT GGCGCGGCAT GCTGGGCTGG AAGTAATTAG TCTGAGTATT
AATATTAATC TTCCGCAGGC CGCCAATTAT CCGCCAGTTG TCACCAATAT TGTGCCCGAT
AACACGCCTG CCTCGCTCTC TAAAGCTGGC ATGGCTTGGA AATTTAGTGG GCTAGCAATC
AAAATAACCA ACCTTTTGAT TGGCTACAAT TTCCAGAAAA AGCTAACCGA ACTGGCAACC
CACTTTGGGT TTTCAGCCGA TTTAGTTGTG CCAGCGGCCC TCTTTCCACG CTTGCGACCC
CAACTTGAGA TACCCGAACT CGTGCTTTGC CCGCAAGCCT TTGATTTTCC ACGGCCAAGC
GTTGAGCAAG GGATTTTCTA CTGTGAGCCA TCAATCGATC TTGATCGCCA AGAAGCGGCC
TTTGATTGGT CGCAGATTGA CCCCAACAAG CCGCTGATTT TTTGTACCTT GGGCAGCCAA
AGCCATATCT ACAAGCCAAG TCGGCGCTTT TTTCAAACCG TGATCGACAC CATGCGCAGT
CGCCCCGATT GGCAATTGAT TATGGCGCTC GGCCAGAAAT TTCAAGCCCA TGAATTTGCC
AATGTGCCAG CCAATGTGCA GTTGCTGCAA TGGGCTTCGG TCGAGCAAAT TTTGCCGCGC
ACCAGCGTGA TGATTACCCA TGGCGGGGTT GGCACGATTA AGGAATGTGT CTATTTCAAC
GTGCCAATGG TGGTTTTTCC AGGCAACCGT GATCAACCTG GCTATGCGGC GCGGGTTGTT
TACCATGAGC TAGGTTTGAT GGGTTCGATG GGGAAAGTTT CGGCCCAAGC GCTGGAAACC
ATGCTCAACC AAGTGATTCA AAATCCTAAC TTTAAACAAC GAGTTACGGC AATGGGCGAG
GAATTTCGCG CCTTGGAAGC TGCTAGCCCA GCGCTGGAAT TAATTCAAAG CAAATTACCG
CATACCCGAA CCGTTGCCTC GTAA
 
Protein sequence
MATIVISMFP EEGHLIPSFK LGKSLKAQGH QVYYLALADF EEYIRKQGFE YLPLFAEDFP 
KGFRAQQTER IANTRGRAFL KEVSQTAFYL KLFQTVHEDQ NPIKRSLLEI GTDLCLFDGF
LAPLSLMARH AGLEVISLSI NINLPQAANY PPVVTNIVPD NTPASLSKAG MAWKFSGLAI
KITNLLIGYN FQKKLTELAT HFGFSADLVV PAALFPRLRP QLEIPELVLC PQAFDFPRPS
VEQGIFYCEP SIDLDRQEAA FDWSQIDPNK PLIFCTLGSQ SHIYKPSRRF FQTVIDTMRS
RPDWQLIMAL GQKFQAHEFA NVPANVQLLQ WASVEQILPR TSVMITHGGV GTIKECVYFN
VPMVVFPGNR DQPGYAARVV YHELGLMGSM GKVSAQALET MLNQVIQNPN FKQRVTAMGE
EFRALEAASP ALELIQSKLP HTRTVAS