Gene Hlac_3448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3448 
Symbol 
ID7402294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp197906 
End bp199165 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content67% 
IMG OID643709989 
Productpeptidase M20 
Protein accessionYP_002567555 
Protein GI222481319 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCACA CACGGACAGA CGTTCAGCGG ACCGACGACC GCCGAAATCG GCTCGCCGAG 
ACGACGCTCG AGTTGCTCGC GTTCGACACA CAGAACCCAC CCGGCGAGAC CCGACAGGCG
TTCGACTGGC TCGAGCGCTC CGTCCCGGAA CGTGGTGTCG AGATCGATCG GATAGAAGCC
GAACGCGAGA AACCGAACCT CGTCGTGACC ATCCCCGGCG AGCGCGAGTG GACGCTGCTC
TACGAGGGCC ACCTCGATAC CGTCCCCTAC GACCGGGACT GCTGGTCGCA CGATCCACTG
GGCGATCGCG TCGACGACCG GCTCTACGGC CGCGGTGCGA CCGACATGAA GGGTGCGGTC
GCAGCGATGC TCGAAACGAT GCGGACGTTC GCCGACGAGA CGCCGCCGGT GACCCTGCAG
TTCGCGTTCG TCAGCGACGA GGAGACCGGT GGGGGCGCGG GAATCGACGC CGTGCTGGAC
GCCGAGGCGA TCAGCGCCGA CGCCGCAGTG GTCGGCGAGA CGACCTGCGT CGACGAACGC
CACTCGATCG CTGTCGCCGA CAAGGGTCGA ATCTGGCTCA CGCTCGAGGC GACCGGGCGG
GCCGCCCACG GCTCCCGGCC GATGAACGGC GAGAACGCGA TCGATTACCT CTACTCGATG
ATCGATTCCT GTCGGGAATC GATTACGTCC CGTCGGCTGG AGTACGATCC GGCGGTCGAA
CGGATCCTCG AGGAGTCTCG GGCATACTAC GGGTCTTGTC CGTGCGAGGC TGGGACACAC
CTCGAAGAGC TCTTCGAGTA CCCCACGTTC AACCTCGGGC GTCTGGACGG CGGCAACACC
GTCAACAGCG TCCCCCAGAC TGCGACCGGC GAACTCGACG TTCGGGTGAC GCCGGGAGCC
TCTACCGGGG CGGTTCTGGA GCAGATCCGG ACGTGTATCG ACGGCCGGGA GCACGTCTCG
ATTCGGGACG TCTCCTGGGC CGAGGGAACC TACGTCGAAC CGTCCGCTCC GATCGTCGAG
GCCGTCACCA CGGCGGCCGC GGACGTCCTC ACGGATCGGC CGCTTGCCCG CTGTGCGACC
GGTGGTGGCG ACGTCAAGAA GCTCCGGGCG GCGGACGTTC CCGCAGTCGA GTGTGCAATC
GGGAGCGATA CCGCCCACGG TGTCGACGAG TACGTCCCGA TCGACGCGCT CGAACGCACG
GCTAAGTGGT ACGTGCGGCT ACCGGGCCAG CTCGCCGAGT CGATCGGGTC CAAGCGCTAG
 
Protein sequence
MNHTRTDVQR TDDRRNRLAE TTLELLAFDT QNPPGETRQA FDWLERSVPE RGVEIDRIEA 
EREKPNLVVT IPGEREWTLL YEGHLDTVPY DRDCWSHDPL GDRVDDRLYG RGATDMKGAV
AAMLETMRTF ADETPPVTLQ FAFVSDEETG GGAGIDAVLD AEAISADAAV VGETTCVDER
HSIAVADKGR IWLTLEATGR AAHGSRPMNG ENAIDYLYSM IDSCRESITS RRLEYDPAVE
RILEESRAYY GSCPCEAGTH LEELFEYPTF NLGRLDGGNT VNSVPQTATG ELDVRVTPGA
STGAVLEQIR TCIDGREHVS IRDVSWAEGT YVEPSAPIVE AVTTAAADVL TDRPLARCAT
GGGDVKKLRA ADVPAVECAI GSDTAHGVDE YVPIDALERT AKWYVRLPGQ LAESIGSKR