Gene Haur_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2249 
Symbol 
ID5734136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2866610 
End bp2867881 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content52% 
IMG OID641279390 
Productglycosyl transferase group 1 
Protein accessionYP_001545017 
Protein GI159898770 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCCGCG TTGCAATGTT GAGTGTCCAT TCGAGTCCTC TAGCTGCTTT GGGTGGTAAA 
GAAGCTGGTG GTATGAATGT TTATGTTCGT GAATTAAGCC GTGAGTTGGG TCGCCAAGGG
GTGGCGGTCG ATATTTATAC TCGGTCGCAA GACCCTCACA CACCGTTGAT TACCAATCTT
GCGCCAAATG TGCGGGTGTT TGCGGTGCGT GTCGGCCCGG CTGCGCCCTA CGATAAGAAT
TGGGTTTTGG ATTATTTACC AGAATTTGTC CATCGGATTC GCTGTGTTGC TGATGGCGAA
GATATTCATT ACGATGTGAT TCATAGCCAT TATTGGCTCT CTGGCGTTGC CGCGCTAGAG
TTACGCCAAG CTTGGGGTAC ACCAGTTATT CATATGTTTC ACACCTTGGG GGCGATGAAA
AATACAATTG CCCGTGGCGA TGAAGCTGAA ACTGAACAAC GAATCGCGAT CGAACGCATG
CTGTTGCACG AAGTTGATCG AGTTGTTGCG GCTACGCCGC TTGATCGTGC TCAGATGCTC
GAACACTATG ACGCTGAGTG TGAGCGGATT GTGGTTGTGC CGTGTGGGGT TGATGTTGAG
CATTTCCACC CGATTGCGCA TCAAATTGCC CGCAATGAAT TAGGCGTGCC GCCGCATCCT
CATCGTATGT TGCTGTTTGT CGGGCGGATC GAGCCACTCA AGGGAATTGA TACGCTGCTA
CGTTCGATGG CCTTGTTGGC TGAGCAACAG CCCTCGTTAC GTGGCGATAT TTGTTTGGCG
ATTATTGGCG GCGATCGGCG CGAAACCCCA GATCAATGGA GCAGCGAAAT GCGGCGTTTG
CGGCGTTTGC AGGGCGAATT AGGCATAGGC CATTTGGTCA CCTTCCAAGG ATCGCAAGAT
CAGCGCAAAT TGCCTTTGTT TTATAGTGCT GCCGATATGG TGGTGGTGCC ATCGCACTAC
GAATCGTTTG GCATGGTGGC GCTCGAAGCC ATGGCCTGTG GCACGCCAGT GATAGCTTCC
AACGTTGGTG GTTTGCGCTA CACCGTGCGT GACGGCGAAA CGGGCCTACT GGTGCCGCGC
GAAGATCCCG AAGCTTTAGC CGAAAAAATT AGTTTGCTCT TGAATGATGA GCCTTTGCGT
TTACAATTAG GCCGCAACGG CGTGCAAGCA GCCCAACGCT ATAGCTGGGC CGCAGTTGCC
CACGATATTC GTGAGTTGTA TGATCATGTT GTGTGTGGCG AACCATATGC CGATGTGGTT
GGAGCCATGT AG
 
Protein sequence
MRRVAMLSVH SSPLAALGGK EAGGMNVYVR ELSRELGRQG VAVDIYTRSQ DPHTPLITNL 
APNVRVFAVR VGPAAPYDKN WVLDYLPEFV HRIRCVADGE DIHYDVIHSH YWLSGVAALE
LRQAWGTPVI HMFHTLGAMK NTIARGDEAE TEQRIAIERM LLHEVDRVVA ATPLDRAQML
EHYDAECERI VVVPCGVDVE HFHPIAHQIA RNELGVPPHP HRMLLFVGRI EPLKGIDTLL
RSMALLAEQQ PSLRGDICLA IIGGDRRETP DQWSSEMRRL RRLQGELGIG HLVTFQGSQD
QRKLPLFYSA ADMVVVPSHY ESFGMVALEA MACGTPVIAS NVGGLRYTVR DGETGLLVPR
EDPEALAEKI SLLLNDEPLR LQLGRNGVQA AQRYSWAAVA HDIRELYDHV VCGEPYADVV
GAM