Gene Hlac_0670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0670 
Symbol 
ID7401805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp686354 
End bp687715 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content72% 
IMG OID643707736 
Productdiaminopimelate decarboxylase 
Protein accessionYP_002565342 
Protein GI222479105 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00590114 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGACA ACGACGCGGC CGCTGAATCC CCTCTGAGTC CCTCGGTCCG GCGCGTGAGC 
GACTGGGACG CCGACCGTCT CGCTGGCCTC GCGGTCGAAC ACGGCACCCC CCTCTACGTG
CAGGACCTCG ACCGCGTCCG CGAGAACAGC GAGCGCCTGC GCGAGGCCTT TCCGGACGCC
GACGTGCGGT ACGCGGTGAA GGCCCACACC GGTCGGGCCG TGCTGGAGGC CGTCCGCGAG
GCGGGCCTCG ACGCCGAGTG TGCCTCTGCC GGCGAGGTGG ACCGCGCGCT CGCAGCCGGC
TTCGGCGGCG ATCGCCTCCA CTACACCGCG GTCAACCCGC CCGCCCGCGA CCTCGATTAC
GTCGTCGGCG TCGCCGAGGC GGAGCCGGAC CTCACGATCA CTGTCGGCGC GGTCGACACC
CTCGACCGGC TGGCCGAGCG AGGCTACGAC GGGCGGGTCT GCGTCCGAGT GAACCCCGGC
GTCGGCGCGG GCCACCACGA GAAGGTCCGG ACCGGCGGCG CGGCGAAGTT CGGGATCCCA
TACGACCGGG CCGCCGAGGC AACGCGAGAC GCGGCCGAGC GCTTCGACGT GGTCGGAATC
CACGCGCACG CTGGCTCCGG GATCGACCCG GACCAGCTCG ACAGCCACCG CGAGATGGTG
ACACGGATGG GCGGACTGGC GCGGGATCTG ACCGACCCGG ACGAGGGCGG CGTGGCCCCC
GTCGACATCG AGTACGTCGA TGTCGGCGGC GGCTTCGGCG TCCCTTATAA AGAGGACGCG
CCGGCGCTCG ACCTACCGGC GGTCGCCGAG GCGACGCGGG AGGCGGTCGC GCCGCTCCCT
GCGGGTGTCG ATCTCGCGAT CGAGCCCGGG CGGTACGTCG TCGCCGACGC GGGCGTCCTC
CTGACGCGCG TCAACACCGT GAAGCCAACG CCGGACGAGC GCGTCGTCGG CGTCGACGCC
GGGATGACGG ACCTGCTGCG CCCGGCGATG TACGACGCCT ACCACCCGAT CCGGAACCTC
GGGGGCGGGC GAGAGAGCGA CGGGAGCGAT GCCCCCGCCA TCGACGACCG GTCGGCAACC
CCTGTCACGG TCGCCGGACC TATCTGTGAG ACCGGCGATA CGCTCTGTAC AGACCGAGCG
CTCGCCGACC CGGTACGCGG GGACCTCCTC GCGGTCGGGA TCGCGGGCGC GTACGGATAC
GAAATGGCAA CCCAATACAA CTCACGACCG CGGCCGCCGG AGGTCGCACT CGACGACGGG
ACTGCAGCGA TCGTCCGCCG CCGGGAGACA CTCGACGACC TGACGACGGT CGAGCGGGAC
GCGAACCGGA ACCGAGCCGA CCGAACGGAG GCGGGCCGAT GA
 
Protein sequence
MSDNDAAAES PLSPSVRRVS DWDADRLAGL AVEHGTPLYV QDLDRVRENS ERLREAFPDA 
DVRYAVKAHT GRAVLEAVRE AGLDAECASA GEVDRALAAG FGGDRLHYTA VNPPARDLDY
VVGVAEAEPD LTITVGAVDT LDRLAERGYD GRVCVRVNPG VGAGHHEKVR TGGAAKFGIP
YDRAAEATRD AAERFDVVGI HAHAGSGIDP DQLDSHREMV TRMGGLARDL TDPDEGGVAP
VDIEYVDVGG GFGVPYKEDA PALDLPAVAE ATREAVAPLP AGVDLAIEPG RYVVADAGVL
LTRVNTVKPT PDERVVGVDA GMTDLLRPAM YDAYHPIRNL GGGRESDGSD APAIDDRSAT
PVTVAGPICE TGDTLCTDRA LADPVRGDLL AVGIAGAYGY EMATQYNSRP RPPEVALDDG
TAAIVRRRET LDDLTTVERD ANRNRADRTE AGR