Gene Hlac_2664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2664 
Symbol 
ID7400870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2648497 
End bp2649825 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID643709737 
Productpeptidase M20 
Protein accessionYP_002567305 
Protein GI222481068 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTCG ACATGCGCGC GTTCGCCGCC GACCTCTGCC GGTTCGCCTC GACGGCCGGC 
GAGGAGGCGG CCGCCGCCGA TTTCGTCGAG GGCGAACTCG ACGCGCTGGG GTTCGAGACG
TACGCGTGGG ACGCCGACCC GGAACTCCTC GCCGATCACC CCTCCTTCCC GGACGACATC
GACGAGTCGG ACGTGGCGGG CCGACGAAGC GTCGCGGGCG TGCTCGCGCT CGGCGGGAGC
AACAACGGCA ACCCGAGAGA CGACGCCCCC ACGATCGTTG TAAACGGCCA CATCGACGTG
GTGCCGGCCG AGCCCGCCGA GTGGTCGAGT GACCCGTTCG AGCCGGTGTG GGGGGAGAGC
GGGAACAAGA GGGAGGGCAA CGACACCGTC GAGACCCTCA CCGCCCGCGG CGCCGCAGAC
ATGAAGTCGG GGGGCGCGGC GTGCATCGGC GCGGCGCTCG ACGTTCGAGA GGCGGTCGCC
GCCGGTGCCG TTGATCTCCC GGACGCGGGG CTCCGGATCG TCGTCGAGGC GGTCGCCGGC
GAGGAGGACG GCGGGTACGG AGCCGCGACC GCGGCGCTCG CGAATCCCTA TCCCTTCGAT
CGCGACGCCG CGATCGTCGC AGAGCCGACG GAGCTTCGTC CCGTGGTCGC CTGCGAGGGG
TCGCTGATGG CGCGGCTGGA ACTGGTCGGG CGGAGCGCCC ACGCCGCCAC GCGCTGGCGG
GGCGAGGACG TGCTCCCGCG GTTCGAAGCG ATCCGCGAGG CGTTCGCCGA GCTGGAAACG
GAGCGCGGCG AGACGGTCGA TCATCCCCTG TACCGCGAGT TCCCGGTACC GTGGCCCGTC
GTCTGCGGGA CGGTGGAGGC CGGCTCGTGG GCGTCGACGG TGCCGGCGAC GCTGACCGCG
GAGTTCCGGA TCGGCGTCGC TCCGGGCGAA ACGGTCGACG AGGTGGAAGA GACGTTCCGC
GCGCGGTTGG ACGACGTGGT CGCGGACGAC CCGTGGCTCC GCGAACACCC GCCGACGTTC
GAGCGGTTCT CGGTGCAGTT CGAGCCCGCC GAAATCGCCG TCGACGAGCC AATTGTCGAG
GCGGCTCGGG CGGGAATTGT CGAAGCGGGG CTTCCGGACG CGGAGCCGAC GGGCGCGACA
TACGGCGCAG ACTCCCGGCA CTACATCGCG GCGGGGATCC CGACGGTCCT GCTCGGGCCC
GGAAGCATCA CGGAGGCGCA CTATCCGGAC GAGACGATCG CGTGGGACGA GGTCGAGCGC
GGGCGAGAAG CGATCGCGGC TGCGGTCGGC CGGTTCGCGG CGGGCTACGC CGCGTCAGAC
TCGAAGTAG
 
Protein sequence
MNFDMRAFAA DLCRFASTAG EEAAAADFVE GELDALGFET YAWDADPELL ADHPSFPDDI 
DESDVAGRRS VAGVLALGGS NNGNPRDDAP TIVVNGHIDV VPAEPAEWSS DPFEPVWGES
GNKREGNDTV ETLTARGAAD MKSGGAACIG AALDVREAVA AGAVDLPDAG LRIVVEAVAG
EEDGGYGAAT AALANPYPFD RDAAIVAEPT ELRPVVACEG SLMARLELVG RSAHAATRWR
GEDVLPRFEA IREAFAELET ERGETVDHPL YREFPVPWPV VCGTVEAGSW ASTVPATLTA
EFRIGVAPGE TVDEVEETFR ARLDDVVADD PWLREHPPTF ERFSVQFEPA EIAVDEPIVE
AARAGIVEAG LPDAEPTGAT YGADSRHYIA AGIPTVLLGP GSITEAHYPD ETIAWDEVER
GREAIAAAVG RFAAGYAASD SK