Gene Hlac_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2666 
Symbol 
ID7400872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2650630 
End bp2651751 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content67% 
IMG OID643709739 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_002567307 
Protein GI222481070 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAC GCGTACACGA GCACGCCGAA GTGCTCGTCG ATTGGAGCGC ACGCATTGAG 
GCCGGCGACG ACGTTGTCGT GAGCGTCGCC GAGGATGCCC ACGACCTCGG CGTCGCAGTC
GTCGAGGCGC TCGGCGAGCG GGGCGCGAAC GCCACGACAC TGTACGGGTC GGCGGAGATC
TCGCGGGCGT ATTTAAAAGG GAGTGAGCAG GGCAGTCACG GCTTCGACGA CGATCCGGCC
GTCGAGCGCG CGCTGTTCGA GGCCGCGGAC GCCTACCTCC GGATCGGCGG CGGCCGCAAC
ACCACCGCGA CCGCAGACGT GTCGAGCGAG ACGCGGCAGG CGTACGCGAA GGCGCGGAAG
GACGTGCGCG AGGCGCGGAT GGACACCGAC TGGGTGTCGA CGGTCCACCC CACGCGCTCG
CTCGCCCAGC AGGCCGGTAT GGCCTACGAG GAGTATCAAG AGTTCGTCTA CGACGCCGTC
CTCCGCGACT GGGAGGCACT TGCCGACGAG ATGGCGGCGA TGAAGGAGGC CCTCGACGCG
GGTGAGGAGG TCCGGATCGT CACCGATCGC GACGACGCCC CCGACACCGA TATTTCGATG
TCGATCGCGG GCCGGACCGC GGTCAACTCT GCCGCGTCGG TCGCGTACGA CTCACACAAT
CTCCCCTCCG GTGAGGTGTT CACCGCCCCC TACGACACCG AGGGCGAGGC GTTCTTCGAC
GTGCCGATGA CGATCGACGC GACCCGCGTT CGGGACGTGC ACCTCGTCTT CGAGGACGGC
GAGGTCGTCG ACTTCTCGGC GGGCGCCGGC GAGGACGCCC TCGCAAGCGT GCTCGACACC
GACCCCGGAG CCCGGCGACT CGGTGAACTC GGTATCGGGA TGAACCGCGG CATCGATCGG
TTCACCGACT CGATCCTCTT CGACGAGAAG ATGGGCGACA CGATCCACCT CGCGGTGGGA
CGCGCCTACG ACGCCTGCCT GCCGGAGGGC GAATCGGGCA ACGACAGCGC GGTCCACGTC
GACATGATCA GCGACGTGAG CGAGAATTCT CGAATGGAGA TCGACGGCGA GGTCGTTCAG
CGCAACGGTA CGTTCCGGTG GGAAGACGGG TTCGACGGCT GA
 
Protein sequence
MDERVHEHAE VLVDWSARIE AGDDVVVSVA EDAHDLGVAV VEALGERGAN ATTLYGSAEI 
SRAYLKGSEQ GSHGFDDDPA VERALFEAAD AYLRIGGGRN TTATADVSSE TRQAYAKARK
DVREARMDTD WVSTVHPTRS LAQQAGMAYE EYQEFVYDAV LRDWEALADE MAAMKEALDA
GEEVRIVTDR DDAPDTDISM SIAGRTAVNS AASVAYDSHN LPSGEVFTAP YDTEGEAFFD
VPMTIDATRV RDVHLVFEDG EVVDFSAGAG EDALASVLDT DPGARRLGEL GIGMNRGIDR
FTDSILFDEK MGDTIHLAVG RAYDACLPEG ESGNDSAVHV DMISDVSENS RMEIDGEVVQ
RNGTFRWEDG FDG