Gene Hlac_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3044 
Symbol 
ID7398894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp302825 
End bp304009 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content61% 
IMG OID643706851 
ProductMur ligase family CapB protein 
Protein accessionYP_002564473 
Protein GI222475952 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGTCGA GCCGCCAAAC AGTGAAAAAG CAGGCACTCG ACGGACTCCG CACTGTCGCC 
AGCGGTGTTG TCGAGGCACT TGGTGCCGGT CCCGCCCACC GTCGGCGGCT CGATGAGATC
GATACCCGAA TCGTCGTCAG TGGTGTCCGC GGGAAGTCAT CGGTGGCCAA CTGGCTGCAC
GAACAGTTCG TCAGCCGCGG CTACGACACC TACACCAAGA TAACCGGATC CGATGCACAG
GTTCGCTACA ACGACACCGT CTCCGAGATC GAGCGCGAAC AGCAGGTCAG ACTGTACGAA
AACGAGCGTG AGCTGGCTCG GTTCGACAGT ATCGACGTCG CAATCGTCGA GAATCAAGGA
ATCAGACCCT ACACGACACG GCTGGTCAAC GAGCAGTTCG TCGACCCCGA TCTGGTGTTT
CTCACCAATG TCCGGGAGGA CCACCTCGAC ACGCTAGGCC GTGATCGCAC CCAGATCGCT
CGGTCGCTCA CCCGTGCAGT CCCTCAGGGG ACATCAGTTG TCTGTGCCGA ACAGTACAAA
CCGCTACGTG AGTACATCCA GACCGAACTC GAGCGTCGGG ATGCACCAGT TACCTTCGTC
GACCCGCCGT CGGGGACCGA GAGCGTGCCA GGCAGTGAGT GTGTGTATGG GCTCAACGAC
GTGCTCGCAG CAGTCGGTGA GCCGCCTGTT CCAACCCAAG AGATCCAGGA CCGAATCGAT
ACGCTTCGTC CGTCGTGGCA GCAGCTTCCT GGTGGTCGGG TGTACAATGC GGCGGCGGTC
AACGACGTCC AGAGCACGGA ACTCGTTCGA CAGTCGCTCG TTGAGGATCG GGAGACAGTA
ATCGAACCAG TGTTGAACCT CCGGTGGGAT CGCCGGGGGC GAACGGTGTC GTTCATCCGC
TATCTCGACG ACCTCTACGA GTCGGGGGCA GTCGAGCAGA CCCACATCGT CGGCGACGAT
CAACAGCTGT TCGAGACGAC CGCCTCCCTC CCGGTCGTTC GCCACGACAC CGAGACTGAA
TCGCCGGCGG CAGTTCTAGA TGACGCGGTA GCCTCGGGTC GGCCAGTGGT GCTGATGGGC
AACACAGTCA CGGCGTTCAT GGAGGCGATG GCCAGGGAAA TCGAGAGCCG AGCAGGGACA
GACAGTGACG CTCCGGAGGC GACAACCGCT CCAGAAACAG CCTGA
 
Protein sequence
MWSSRQTVKK QALDGLRTVA SGVVEALGAG PAHRRRLDEI DTRIVVSGVR GKSSVANWLH 
EQFVSRGYDT YTKITGSDAQ VRYNDTVSEI EREQQVRLYE NERELARFDS IDVAIVENQG
IRPYTTRLVN EQFVDPDLVF LTNVREDHLD TLGRDRTQIA RSLTRAVPQG TSVVCAEQYK
PLREYIQTEL ERRDAPVTFV DPPSGTESVP GSECVYGLND VLAAVGEPPV PTQEIQDRID
TLRPSWQQLP GGRVYNAAAV NDVQSTELVR QSLVEDRETV IEPVLNLRWD RRGRTVSFIR
YLDDLYESGA VEQTHIVGDD QQLFETTASL PVVRHDTETE SPAAVLDDAV ASGRPVVLMG
NTVTAFMEAM AREIESRAGT DSDAPEATTA PETA