Gene Hlac_1280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1280 
Symbol 
ID7399375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1290098 
End bp1291624 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content67% 
IMG OID643708344 
Productpeptidase M32 carboxypeptidase Taq metallopeptidase 
Protein accessionYP_002565942 
Protein GI222479705 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGG AAGCCGCGGC CGACGACGCA GCCGACGCCC CCGAGGCGTA CGACGCCCTC 
CTGGACCGCG TTCGACGCTG GAACGCGGTC GGGAGCGCCG CCGGCGTGCT CGGGTGGGAC
CAACAGGTGA TGATGCCCGA AGGGGGCACT CCCGCGCGGT CGAAGCAGCT CTCGACGCTC
TCCTCGATCC GCCACGACAT GGTCACCGAC GAGGAGACGG GCGACCTGCT CGACGAACTC
GCCGACGCCG ACCTCACCGA CGAGCGGGCG GCGGTCGTGC GCGAGATCCG CCGCGACTAC
GAACGCGCCG ACGCCGTCCC CGTCGAGCTC GTCGAGGAGA TCTCCGAGAC GGGCACGGAA
GCGCTACAGG CGTGGGAGGA GGCGAAAGCC GAGGACGACT TCGACGCGTT CGCACCCTAC
TTGGAGAAGC ACGTCGAACT GAAACGCGAG TACGCCGAGC ACATCGACCC CGACCGCGAC
CCTTACGAGG TGCTCTTCGA GGACTACGAG CCGTGCCTCT CGATGGAGCG CGCGGAGTCG
ATCCTCGAGG AGCTCCGCGA GACGCTCGTC CCCATGATCG ACGCGATCCG CGAGTCCGAC
GCCGACCTCG CCGTCGACAC CTTCGAGGGA ACCTTCCCAG AAGCGGAGCA GGAGGCGCTG
GCGCGCGAGA CGCTCGAACT GGTCGGCTAC GACTTCGACC GCGGCCGGCT CGACGTCTCC
TCGCACCCGT TCACCGCGGG AAACCAGTTC GACTGCCGGG TGACCACCCG ATTCGACGAG
TCCGACCCGC TCGGCGCGAT CGGCTCGACG ATCCACGAGT TCGGCCACGC GCAGTACAAC
CTCGGGCTCC CGCAGGAGCA GTTCGGGACC CCGCTCGGGG AATCCCGTGA TCTGTCGGTC
CACGAGTCGC AGTCGCGGCT CTGGGAGAAC CACGTTGGTC GCAGCCGGGC GTTCTGGGAG
CTGTTCCTCC CGACCTTCCA AGAGCACTTC CCGGAGACCG ACGACGCCAC CGTCGAGGAC
GCCTATCAGG CGTTCAATCA GGTCCACGAG GACAACCTCA TTCGCGTCGA GGCCGACGAA
CTCACCTATC ACCTCCACAT CGTCGTCCGG TTCGAGATCG AACGCGACCT GGTCCGCGGC
GACCTCGCGA TTGAGGACGT ACCCGAGGCC TGGAACGACA AGTACGAGGA GTACCTCGGA
ATCCGCCCCG ACAACGACGC CGAGGGCTGC CTGCAGGACA TCCACTGGAG CCACGGCAAC
TTCGGCTACT TCCCCACTTA CTCGCTCGGC TCCGTGATGG CCGCCCAGCT GTTCGCGGCC
GCCGAATCGG AGATCGACGA CCTCGACGAC CAGATCGCCG CGGGCGAGTT CGACGACCTT
CGGGAGTGGC TCGGTGAGAA CGTCCATCAG CACGGCTCCC GCTACGAGAC GAACGAGCTG
GTGAAACGCG CCACCGGCGA GGACTTCTCG GCGGACGCCT TTACCGACTA CGTCGAGGAG
AAGTACGGCG AGCTGTACGG TATATAA
 
Protein sequence
MATEAAADDA ADAPEAYDAL LDRVRRWNAV GSAAGVLGWD QQVMMPEGGT PARSKQLSTL 
SSIRHDMVTD EETGDLLDEL ADADLTDERA AVVREIRRDY ERADAVPVEL VEEISETGTE
ALQAWEEAKA EDDFDAFAPY LEKHVELKRE YAEHIDPDRD PYEVLFEDYE PCLSMERAES
ILEELRETLV PMIDAIRESD ADLAVDTFEG TFPEAEQEAL ARETLELVGY DFDRGRLDVS
SHPFTAGNQF DCRVTTRFDE SDPLGAIGST IHEFGHAQYN LGLPQEQFGT PLGESRDLSV
HESQSRLWEN HVGRSRAFWE LFLPTFQEHF PETDDATVED AYQAFNQVHE DNLIRVEADE
LTYHLHIVVR FEIERDLVRG DLAIEDVPEA WNDKYEEYLG IRPDNDAEGC LQDIHWSHGN
FGYFPTYSLG SVMAAQLFAA AESEIDDLDD QIAAGEFDDL REWLGENVHQ HGSRYETNEL
VKRATGEDFS ADAFTDYVEE KYGELYGI