Gene Hhal_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2114 
Symbol 
ID4711267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2319187 
End bp2320443 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content69% 
IMG OID639856588 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001003680 
Protein GI121998893 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGAC TGCTCATCCG GGGCGGTAAC CCCCTGCGTG GCGATATCCG TATCTCCGGG 
GCCAAGAATG CGGCCCTGCC GGTCATGGCG GCGACCCTGC TGGCCGACGG CCCGACCACC
GTCGGCAACA TCCCGCATCT CCACGACGTA ACCACCACCA TGGAGCTGCT CGGGCGCATG
GGGGTGGAGC TCACCGTGCA CGAGGGTATG GAGGTCGAGG TCAACACCGC GACCATCCAC
AGCTTCCGGG CGCCCTACGA ATTGGTGAAG ACCATGCGCG CCTCGATCCT GGTGCTCGGG
CCGCTGCTGG CGCGCTTCGG CCAGGCCGAG GTGTCGCTGC CCGGGGGCTG CGCCATCGGC
TCGCGGCCGG TCAACATCCA CGTGGACGGG TTGCGGGCCA TGGGCGCCGA GATCGAGGTC
CGCGACGGGT ACATCAAGGG CCGCGCCGAC CGGCTCCAGG GGGCGCACAT CCGCATGGAT
GTCTCGACGG TGACCGGAAC CGAAAACCTG ATGATGGCGG CCGCCCTGGC GCGCGGAACC
ACGGTACTCG AGAATGCCGC CCGCGAGCCC GAGGTCATCA ATCTGGCCGA CTGCATCAAC
GCCATGGGGG GGCACGTCCA GGGGGCCGGC ACCTCGACGA TCACCATCGA GGGGGTCGAC
ACGCTGCGCG GCGTTCACCA TCGAGTGCTG CCCGATCGCA TTGAGACCGG CACCTACCTG
GTCGCTGCGG CCATGACCGG GGGAGAGGTG CGGCTCAAGG ACACGGCGCC GGAGCTGGTC
GAGTCGGTGC TCGGCAAGCT GCGCGAGAGC GGGGCAGAGG TGAGCGCCGG TCGCGACTGG
GTGACCCTGC GCATGGAGGG CCGCCCGCGC GCCGTGGACC TGGAGACGGC GCCGTACCCG
GGCTTCCCCA CGGACATGCA GGCCCAGTTC TGTGCCCTGA ACGCCATCTC CACCGGGGAG
GGGACGGTGA CCGAGACCGT CTTCGAGAAC CGGTTCATGC ACTGCCTGGA GATGCAGCGC
ATGGGTGCCG ACATCCAGAT CGAAGGGGCC CGGGCGCGCA TCCGCGGCGT CGAGAAGCTG
ACCGCGGCGC CGGTGATCGC CACCGATCTG CGCGCCTCGG CGAGTCTGGT GCTGGCCGGG
CTGGTGGCCG AGGGTGAGAC CCGGGTGGAC CGGATCTACC ACATCGACCG CGGCTACGAG
TGCATCGAGG AGAAGCTCGC CCAGCTCGGT GCCGATATCC AGCGCGTCCC CGACTGA
 
Protein sequence
MERLLIRGGN PLRGDIRISG AKNAALPVMA ATLLADGPTT VGNIPHLHDV TTTMELLGRM 
GVELTVHEGM EVEVNTATIH SFRAPYELVK TMRASILVLG PLLARFGQAE VSLPGGCAIG
SRPVNIHVDG LRAMGAEIEV RDGYIKGRAD RLQGAHIRMD VSTVTGTENL MMAAALARGT
TVLENAAREP EVINLADCIN AMGGHVQGAG TSTITIEGVD TLRGVHHRVL PDRIETGTYL
VAAAMTGGEV RLKDTAPELV ESVLGKLRES GAEVSAGRDW VTLRMEGRPR AVDLETAPYP
GFPTDMQAQF CALNAISTGE GTVTETVFEN RFMHCLEMQR MGADIQIEGA RARIRGVEKL
TAAPVIATDL RASASLVLAG LVAEGETRVD RIYHIDRGYE CIEEKLAQLG ADIQRVPD