Gene Hore_21240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_21240 
Symbol 
ID7313362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2313561 
End bp2314931 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content42% 
IMG OID643612576 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_002509864 
Protein GI220932956 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000272523 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTC AGGTATTAAC TATAATTCTC GCCGCCGGCA AAGGAACAAG AATGAAATCC 
GGGCTGGCTA AAGTTTTGCA TCCTGTTGCT GGAAAACCCA TGATCAGCCA CGTGATTAAT
AGTGCTTCTC CAATCAGCTC TTCTGTTGTG GTGATAGTGG GGTACCAGGG TGATAAAGTA
AAAGAGACCC TGGGTACAGG CTATACATAC GTTCGCCAGG AAGAACAGCT GGGGACCGGC
CATGCGGTTT TACAGGCTAA AAAGTTAATC AAAAAACATC AGGGGCAGGT CCTGATTCTC
TGTGGGGATA CCCCGTTATT AAGAGAAAAA ACTTTAAGTG AACTGGTAGA TGCCCAGAGA
GAAACCGGGG CTGGAGTGGC TGTATTAACA GCCGATATAG ATAACCCCCG GGGTTATGGG
AGAATTATCA GGAATGAGGC CGGAAACCAG ATAATAAAGA TAGTTGAGGA CTCTGATGCC
AGTGATGAAG AAAGACTGGT CAATGAAATT AACAGTGGAG TTTATTGTTT TGACAGCAAC
CAGCTCAGTG AGGCTCTGGA AAACTTAACA AATGATAATG CCCAGGGTGA GTATTATCTT
ACTGATACTA TTGCTTATTT GAGAAATAAA GGGGAGGTAG TAGTCCCCGT TAAAGTTGAT
GATTCCCGGG AGATTATTGG CGTAAATGAC CGGAGAAACC TGGCTAGGGC TGAGAGGGTT
TTAAGAAACA GGATTATAAA TTATCACCTG GCCAATGGTG TCAGTATTAT TGACCCTGAC
ACCACTTATA TTGATAGTAC TGTTGAAATT GGACAGGATA GTGTTATTTA TCCTTTTACT
TATATAGAAG GAAGAACACG AATTGGCTCA GAAGTGGTTG TCGGCCCCCA TTCACATTTA
ATAAATGCTG AAATTGGAGA TAGAAGTAAA CTCCTCGATT CTACTGTTAT TAAAGATAGC
AAAATAGGGG AAGATACCAA TATTGGTCCC TTTGCCTATA TAAGGCCTGG CTGTCAGATT
GCCAGTGGGG TTAAGGTTGG GGATTTTGTT GAATTAAAAA AAGCAAAGAT CGGGGAAAAT
ACCAAAGTTC CTCATTTGAG TTATGTAGGG GATGCTGAAA TAGGGGAAAA TAGTAATATT
GGAGCCGGTA CAATATTTGC AAATTATGAT GGGAAGAAAA AGCATAAAAC AAAAGTGGGT
AATAATGCCT TCATCGGGAG TAATACAACC TTGATTGCTC CAGTAACGGT TGGAAACAGG
GGAAAAACAG GGGCCGGTGC TGTTGTGACT AAAGATGTCC CCGGAGGGGT AACAGTGGTC
GGTGTACCTG CTCGTAAATT TAAAAAGGAT AACATTGAGG GGGATAAATA A
 
Protein sequence
MESQVLTIIL AAGKGTRMKS GLAKVLHPVA GKPMISHVIN SASPISSSVV VIVGYQGDKV 
KETLGTGYTY VRQEEQLGTG HAVLQAKKLI KKHQGQVLIL CGDTPLLREK TLSELVDAQR
ETGAGVAVLT ADIDNPRGYG RIIRNEAGNQ IIKIVEDSDA SDEERLVNEI NSGVYCFDSN
QLSEALENLT NDNAQGEYYL TDTIAYLRNK GEVVVPVKVD DSREIIGVND RRNLARAERV
LRNRIINYHL ANGVSIIDPD TTYIDSTVEI GQDSVIYPFT YIEGRTRIGS EVVVGPHSHL
INAEIGDRSK LLDSTVIKDS KIGEDTNIGP FAYIRPGCQI ASGVKVGDFV ELKKAKIGEN
TKVPHLSYVG DAEIGENSNI GAGTIFANYD GKKKHKTKVG NNAFIGSNTT LIAPVTVGNR
GKTGAGAVVT KDVPGGVTVV GVPARKFKKD NIEGDK