Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3947 |
Symbol | |
ID | 8744575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 205276 |
End bp | 206916 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646514528 |
Product | N-acyl-D-glutamate deacylase |
Protein accession | YP_003405475 |
Protein GI | 284167197 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3653] N-acyl-D-aspartate/D-glutamate deacylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.291331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCTG TTAGCACGGG GTCCGTAGAG TTTCGAAACG CGCGCGTTCT CGACGGCACC GGCGGGCCGG TCTTTACCGC ACACGTTCTG GTCGATGGCG ATCGGATCGA ACGCGTCGGT GACGAACCGG CCGGCGCCGA CAGGGTTATC GATCTCGACG GCGCGTATCT CGCTCCGGGG TTCGTGGATA TGCACGCCCA TTCGGAATTG CGACTGATGA CCAAGCCGGC CGCCGAGGAG AAACTCACGC AGGGAATCAC CACCGAAGTC CTCGGACAGG ACGGTGTTAG CGTCGCTCCC GTGCCGGATG AACTGAAGAC GGAGTGGGAG AAGCGCATCC AGTCGCTGGA CGGAACGATC GATCGGCACT GGCCCTGGAA CTCCGTCTCC GAGTATCTCG ACGAACTCGA CGAGGCAGCG CCGGCCGTCA ACGCCGCCCA CTACGCCCCT CACGGGAACC TTCGGTCTCA CATCTCCGGC TTCGAGGACC GAGAACTCGC GACGGCGGAG ATCGCTGAGC TGCAAGAGCG GCTCGATCAG GCTATCGATG GCGGCGCGTT CGGGATGTCC ACGGGGATGA TCTATCCACC GAGTTCCTAC GGGCGCGACC CGGAGCTCGA GGCGCTCGCC GAGACGCTCG CGACTCGAGA CTCGTTTATG ATCTCTCACG TGTGGAACGA GACCGATCAC GTCGTCGAAT CCATCGACCG TTATCTCGGA ATCTGTCGTC GCGGCGGCTG TCACGCGCAC GTTTCACACC TGAAAGTCGG CGGCCGACGG AACTGGGGGC GTTCCGCGGA CGTCCTCGAT CTGTTCGACG ACGCGGTCGA CGCGGGACAG CGGGTCTCGT TCGATCAGTA CCCGTACACG GCGGGATCGA CGATGCTCAC CGCGTTGTTG CCGCCGTGGG CCCGTCGGAG AGAGACGTCG GACATCCTCG AACGGCTGCG GCGAGCGGAC GTCAGAGAGC GCCTCGCCGA GGATATCTCG TCCCCCGGCG ATTGGGAGAA CTTGGCCCGC GCGGCGGGCA CGTGGGACAA CATCCTCATC ACCCGGACCG CGAGCGGCGA CTATCAGGGG AACACGATCG AAGAGATCGC GACCGAACGA AACCTCGACC CGATCGATAC GCTGTGTGAA CTCCTCGTCG AGGAGAACCT GGACGTGACG ATGGCGGACT TCGTCATGGC GGAGGAGGAC ATCGAACGGT TCCTCGCGGA CGACCGGGGA ACGTTCTGCA CGGACGGCAT CTTCGGCGGG AAACCCCATC CGCGAGCCGT CGGGACCTTC GGCCGCATCC TCGAGCGGTA CGTCCGCGAG CGCGACGTGC TATCGCCGTC GCTGATGGCG CGCAAGGCCG CCGGTCACCC CGCCGATATC CTCGGATTAG CCGATCGAGG CTACGTGAAG GAGGGGTACG TCGCCGATCT CGTCGCGTTC GACCTCGACG CCGTGTCCGA GAACGCCACC TACGAGGACC CGATCCAGTA CACCGACGGC TTCGACTACG TACTCGTCGG CGGAGCGATC GCCGTCGAGG ACGGGGCGAC GACGGACGTC CGTAACGGCG AGATCCTGCG CTCGACCGAC GAGTGGGACG GCCGCTCGCG CCCGTCGCTC GACCGGTCGT CCGACGAGTA G
|
Protein sequence | MSSVSTGSVE FRNARVLDGT GGPVFTAHVL VDGDRIERVG DEPAGADRVI DLDGAYLAPG FVDMHAHSEL RLMTKPAAEE KLTQGITTEV LGQDGVSVAP VPDELKTEWE KRIQSLDGTI DRHWPWNSVS EYLDELDEAA PAVNAAHYAP HGNLRSHISG FEDRELATAE IAELQERLDQ AIDGGAFGMS TGMIYPPSSY GRDPELEALA ETLATRDSFM ISHVWNETDH VVESIDRYLG ICRRGGCHAH VSHLKVGGRR NWGRSADVLD LFDDAVDAGQ RVSFDQYPYT AGSTMLTALL PPWARRRETS DILERLRRAD VRERLAEDIS SPGDWENLAR AAGTWDNILI TRTASGDYQG NTIEEIATER NLDPIDTLCE LLVEENLDVT MADFVMAEED IERFLADDRG TFCTDGIFGG KPHPRAVGTF GRILERYVRE RDVLSPSLMA RKAAGHPADI LGLADRGYVK EGYVADLVAF DLDAVSENAT YEDPIQYTDG FDYVLVGGAI AVEDGATTDV RNGEILRSTD EWDGRSRPSL DRSSDE
|
| |