Gene Hlac_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1083 
Symbol 
ID7400155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1082608 
End bp1083816 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID643708149 
ProductNucleotidyl transferase 
Protein accessionYP_002565748 
Protein GI222479511 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.39401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGGAG TCGTGCTCGC GGCCGGCCGC GGGACCCGCA TGCGACCGCT AACGGACCGT 
CGTCCGAAGC CACTTCTTCC GGTCGGCGAT CGGTCGCTGC TCGAACGGGT GTTCGACACC
GTGGCCGGTG TCGTCGACGA GTTCGTCGTC GTAGTCGGAT ACCGCGGCGA CGCGATCCGC
GACGCGATCG GCGAGTCGTA TCGAGGCTAT CCGGTCCACT ACGTCGAGCA GGCGGAGGCG
TTGGGGACCG CTCACGCCGT CGCGCAGGCC GAGCCCGTCG TCGACGAGGA CTTCCTCGTG
CTCAACGGCG ACGTGGTCGT GGATGCATCG CTCCCCCGCT CCCTTGCCGA CGCCGACGGG
ACGGCAGTCG CGGCCACGGA GGTCGTCGAT CCTCGGGCAT ACGGTGTGCT TTCGACGACT
GAGGACGGCT CGCTCGCCGG GATCGTCGAG AAGCCCGACG ACCCGCCGAC GAATCTCGCG
AACGTCGGCT GTTACGCGTT TCCGCCCGAG GTCTTCGAGT ATATCGATAG AACCCCCGAG
AGCGAACGCG GCGAGTACGA GATCACGACG ACGATCGAGC TCCTCCTCGA CGACGGCCAT
CCTATCGACG TGGCGCCCTA CGAGGGGACG TGGCTCGACG TCGGTCGTCC CTGGGAGCTG
CTGAAAGCCA ACGAACTAGC GCTCACCGAG TTCACGGATG CCGTCGAGAA CGCTGGGACC
GTCGAGGAAG GCGTCCACCT CCACGGCCCG ATCGTCATTG AGGAAGGAGC GCTGGTCCGG
TCTGGAGCGT ATGTCGAGGG GCCGGCGCTG ATTCGCGAGG GCGCGGAAGT CGGGCCGAAC
GCGTACGTTC GCGGGTCGAC GGTGGTCGGT CCGGACGTGC ACGTCGGACA CGGCGTCGAG
GTGAAGAACT CGGTACTCAT GGCCGACGCG TCGGTCGGGC ACCTCTCGTA CGTCGGTGAC
TCCGTGCTGG GTCGCGGCGT GAACTTCGGC GCCGGGACGA ACGTCGCGAA CCTCCGACAC
GACGACGGGA ACGTCCGGAT GACCGTTAAA GGCGACCGCG TCGACACCGG CCGCCGGAAG
CTCGGGGCGA TCGTCGGCGA CGGCGCGAAG ACGGGGATCA ACACGTCGCT GAACGCCGGC
GTCAAACTGG GTGCAGCGGA GACGACCGGT CCCGGAGAGG TTCTGACTCG CGATCGAGTG
TCGGAGTAG
 
Protein sequence
MYGVVLAAGR GTRMRPLTDR RPKPLLPVGD RSLLERVFDT VAGVVDEFVV VVGYRGDAIR 
DAIGESYRGY PVHYVEQAEA LGTAHAVAQA EPVVDEDFLV LNGDVVVDAS LPRSLADADG
TAVAATEVVD PRAYGVLSTT EDGSLAGIVE KPDDPPTNLA NVGCYAFPPE VFEYIDRTPE
SERGEYEITT TIELLLDDGH PIDVAPYEGT WLDVGRPWEL LKANELALTE FTDAVENAGT
VEEGVHLHGP IVIEEGALVR SGAYVEGPAL IREGAEVGPN AYVRGSTVVG PDVHVGHGVE
VKNSVLMADA SVGHLSYVGD SVLGRGVNFG AGTNVANLRH DDGNVRMTVK GDRVDTGRRK
LGAIVGDGAK TGINTSLNAG VKLGAAETTG PGEVLTRDRV SE