Gene Hlac_0268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0268 
Symbol 
ID7401194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp291089 
End bp292255 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content73% 
IMG OID643707331 
Productpeptidase M24 
Protein accessionYP_002564943 
Protein GI222478706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.798559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAAG ACGTCCACGC GGAACGGCGC GAGCGCGCGG CGGCGCGGCT CCGGGAGACC 
GGCGCCGACG GGCTCGTCTG TTTCCCCAGT CGGAACCTCC AGTACCTCAC CGGCTTCGCC
GAGGAGCCGG GCGAGCGACA CCTCCTGCTC GTCGTGCCGG CGGCCGATCG GAGCTCTGAC
ACCCCCGACG GTGGCGACCA GACCGCCGCC GAACCGACCC TCCTCGTGCC GGCCCTCCTC
GTGCCGGCCC TCTACGAGAC GCAGGTCAAG GAGGAGACCA CGGTCGGTGC GGTGCGGACG
TGGGCCGACG GCGACGACCC GACCGCCGCC GTCCAGGACC TCCTCGGCGA CCTCGGACTC
AGCGAAGGGC GGCTCCTCGT CGACGACACG ATGTGGGCGA CGTTCACGCA GGACCTCCGG
GCCGCCGCGC CCGACGCGGA GTGGGGACTC GCGAGCGAGG CGCTCGCCGA CCTCCGCGTG
CGGAAGGACG AGGCCGAGTT GGACGCGATG CGCGCCGCCG CGGCGGCCGC CGACGAGACG
GTCCGGGACC TCCGCGATCT CGGCGCGGAC GCGGTCGGGA TGACCGAACG CGACCTCACC
GACTGGATCG CGGACCGACT GGCCGCCCAC GGCGGCGAGG GAACCTCCTT CGAGACGATC
GTCGGGTCGG GGCCGAACGG GGCGAAGCCC CACCACGGCT GTGGCGACCG CGAGATCCGG
GCGGGCGAGC CGGTCGTACT CGACTTTGGC ACCCGAGTCG ACGGCTACCC CTCGGATCAG
ACGCGGACGC TCGTCTTCGA CGGCGAGCCG CCCGCCGAGT ACGAGCGTGT CCACGAGACC
GTCAGGGCGG CGCAGGCCGC CGCGGTCGAG GCGGTCGAAC CGGGCGTCGC CGCCGAGGCG
ATCGATCGGG CCGCCCGCGA TGTCATCGAG GACGCCGGGT ACGGCGACGC GTTCTTCCAC
CGCACCGGCC ACGGGGTCGG GCTCGACGTC CACGAGGAGC CGTACATCGT GGCCGGCAAC
GACCGGGAAC TGGAGCCGGG GATGGTGTTC TCGGTGGAGC CGGGGATCTA CCTCGACGGG
CGGTTCGGCT GTCGGATCGA GGACCTCGTC GTCGTCACCG AGGACGGGTG TGAGCGGCTG
AACGACACCG ACCGCGGCTG GCGGTGA
 
Protein sequence
MTEDVHAERR ERAAARLRET GADGLVCFPS RNLQYLTGFA EEPGERHLLL VVPAADRSSD 
TPDGGDQTAA EPTLLVPALL VPALYETQVK EETTVGAVRT WADGDDPTAA VQDLLGDLGL
SEGRLLVDDT MWATFTQDLR AAAPDAEWGL ASEALADLRV RKDEAELDAM RAAAAAADET
VRDLRDLGAD AVGMTERDLT DWIADRLAAH GGEGTSFETI VGSGPNGAKP HHGCGDREIR
AGEPVVLDFG TRVDGYPSDQ TRTLVFDGEP PAEYERVHET VRAAQAAAVE AVEPGVAAEA
IDRAARDVIE DAGYGDAFFH RTGHGVGLDV HEEPYIVAGN DRELEPGMVF SVEPGIYLDG
RFGCRIEDLV VVTEDGCERL NDTDRGWR