Gene Hlac_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0653 
Symbol 
ID7401788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp669661 
End bp670977 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content70% 
IMG OID643707719 
Productpeptidase M28 
Protein accessionYP_002565325 
Protein GI222479088 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.602384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.117412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT GGATCGGCGA CACCTTCACG AGCGACGCCG GCTGGGATCA CCTCGAATCA 
CTCGTCGATA TCGATCACCG CATGGCCGGC TCCGACGGCG AGCGCGCCGG GCTCGAACTG
ACCCGCGACG CGCTGTCCGA CGCCGGCGCG CGTGACGCGC GGATCGAGGA ATTCGAGATT
CAGGGCTGGG AGCGCGGCGA CAGCGAGATC CGACGCGACG AAGAGGTTGT CGCGAGCGGG
CAAAACGCCT GCATCGCGCT CCCGCGGAGC CCGAGCGGCG AGGCGACCGG CGAGTTCGTC
GATCTGGGGT ACGGCGTCCC CGAGGACTTC GACGACGATC TGACGGGGAA GGTCGTGATG
GTCTCGTCGG ACACCCCCGA CTCGGTCGAC CGGTTCATCC ACCGCCGCGA AAAGTACTAC
CACGCGGTGG AGGCGGGTGC CGCCGCCTTC GTCTTCGCGA ACCACGTCGA GGGGACGCTG
CCGCCGACCG GGAGCGTCGG CACCGCGGAT GCGCCGATCG GCGATATCCC GGCGGTCGGC
GTCTCGAAGG AGACCGGCGC GAGCCTCGCG CGCCGACGCG AGGGCGAGGA CCTCACCGTC
GCAGTCAACT GCGAGACGCC CGACGCGACG AGCGGGAACG CGGTCGCCGA CCTCGGTCCC
GACACCGACG AGTACCTCGT CGTCTCCTGC CACGTCGACG CCCACGACCT CGCGGAGGGG
GCGATGGACA ACGGCGCCGG CACCGCGACG ATCGTCGAGG TCGCCAACGC TCTCGCGGCC
CGCGAAGAGG AGCTCGACAC GAGAGTGCGG TTCGTCGGCT TCGGTGCCGA AGAGGTCGGG
CTGGTCGGCT CTTCCCAATT TGCCGCGGGC GTCGACCCCG ACCACGTCAA GGCCGTCGTC
AACGTCGATA GCAACGTGTT CGGTCGTACC CTGAAGCTCG ATCACCACGG CTTCGACCCG
CTGGAGGCGG CCGGCGAGCG CGTGAGCGAC CGGTTCGATC ACCCGATCGC GCTCGGCGAG
GAGCAGGTCC CCCACAGCGA CCACTGGCCG TTCGTCGAGC GCGGGATCCC CGGCTATATG
GTCTCCGGTG AGACGGAGGG GCGCGGCCGG GGCTGGGGAC ACACGGGTGC GGACACGCTC
GACAAGCTGG AGTCTCGGAA CCTCCGCGAG CAGGCGATCC TCCTGACGGC GCTCGTCGTC
GACCTCGCCG GCGACGACGT GTCGACCGCG CGGAAGCCAA CCGACGAGAT CGCGAGCGCG
CTCGAACAAG AGGGGAAGGC GACGGGGATG AAGATAACCG GCGACTGGCC GTTCTAG
 
Protein sequence
MTDWIGDTFT SDAGWDHLES LVDIDHRMAG SDGERAGLEL TRDALSDAGA RDARIEEFEI 
QGWERGDSEI RRDEEVVASG QNACIALPRS PSGEATGEFV DLGYGVPEDF DDDLTGKVVM
VSSDTPDSVD RFIHRREKYY HAVEAGAAAF VFANHVEGTL PPTGSVGTAD APIGDIPAVG
VSKETGASLA RRREGEDLTV AVNCETPDAT SGNAVADLGP DTDEYLVVSC HVDAHDLAEG
AMDNGAGTAT IVEVANALAA REEELDTRVR FVGFGAEEVG LVGSSQFAAG VDPDHVKAVV
NVDSNVFGRT LKLDHHGFDP LEAAGERVSD RFDHPIALGE EQVPHSDHWP FVERGIPGYM
VSGETEGRGR GWGHTGADTL DKLESRNLRE QAILLTALVV DLAGDDVSTA RKPTDEIASA
LEQEGKATGM KITGDWPF