Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1698 |
Symbol | |
ID | 8824538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 1730262 |
End bp | 1733261 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | peptidase M14 carboxypeptidase A |
Protein accession | YP_003479836 |
Protein GI | 289581370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACACC ATACAGACAA CGAATACAGG ACAATAGCCG ACGGGACCGA CGCACCGACT GAAACCGACG AGTCTGGTGT TCAAGCCACG TTTACCGACT CGACTGTCGA CCGACGAACG TTCCTCAGCC TCTCTGTTGC AACCGGCGCT GCGCTCGCAC TCCCCGGCAG CGCCACCGCC GACGTAAGTG ACGACGTGCT CAGCGACGAA CTCGAGTACG TCCTGAATCA CACGCCTGCG GAGTACGAGG CTGCGACCAG CATCGTCTTC ACCGATCAGG ACGTCTTCGA CGCGTTCGCG GACGAGTACG AGGAGGAGCC AGCCCCAGGC CGATCGCGCG CACCGAAGGC GGTCACTCGC GAGTCGCCAA CACTGTCTGC ACATGCACAC CTTACGGCGT CAGAGGTCGA GGACGTGCTC GCGCTGGGTG ATGGCGACGA CGATGATGGT ATCGACGCAA TGAACTTCGC ACCCGGTTCG AACCCCTGGT GGACGCTCGA GGAGCCCTAC GCGGACGGCG TCTTCCCGCC AATCGAGGAG GCTCGCGACT ACATCGCCTA CGAGGAGACG GTGCAGGCAC TCGAGTACAT CGAGGACACG CATTCGGACC GCGTTCGGGT GCAGTCGATC GGCGAGTCGC CAGGGTGGAC GAATCTGTAC ACCGACGAGG ACGCCGATCC GCGGGACGTC TACGTTGCGG AAGTCACGAA CGATGTGCAG GACGACTCGT CGTTCGCCGC GAAGGAGAAG GTCGTCTACT CGCTCAACAT TCACGGCGAC GAGCGGGCGG GAACGGAGGC TGGCTGTCGG CTGATCGAGG AAATCGCACG CGGCGAGGCG GACGATTTCG AGCACCTGCT CGACGATATC GTCTTGTTGT TCCTGTTTAC GAACCCGGAC GGCTGGGTTT CGCGCAAGCC ACAGACCGAG ATTCCGTGGG TTGCCGACCA CAATACCAAC TTCCAGCGTG GCAACGCGAG TACGTTCGAG GGGCAGCCAG TCGACACCAA CCGTCAGTAT CCGACGATGG GCTGGACGAA TCCGAGCTTC CGGCCAGCGG AGCCGGAGGG TGCGCCTGAG GAGTTCCACG ACCTCGTGCC TGACTCACTC GCCATCGTCG AGCACTTCCG CGAGTACGAC AACGTGGCGT TCCTCTGTGA CTACCACGGG ATGTACACCG CGGATCACAT GGTGTTCAAC CTCGAGACGA ACGCGCCGTT CGACCACGAC GGCACGCACG ACTTAGACGA GGTCAACATC CAGATCGGCG AGGGAATGCA GGAGTTCTGG GGAGACATCG ATGCGGTCGC AGATGACATC GCCACGGCCG GCGAGGAGAT GTACGGCTCG CCGTTCGTGC CGGACGGCGA CAGCTACGGC GGCCTGTTCG ACTGGGGAAC GATCTACGAC TCGCTGTCCT ACCAGATCAC TGGCGGCTTC CTCGGCTGGG CCGGCCAGCC AGCGGAGTTC GGCGGACTCG GTGCGATCAC CGTCGCCCCT GAAATCATCC TCTCGAACCA CTCTGTAGCG GCTCAGAAGG AGTGGATGCC CTACTGGACG CGCCACTACG AGGAGGCCTA CCGGATCTCG ATGCGCGAGT ACGCTGCGAT GACGGCCCGC GAGACGCACG CGACGGTCGA AACCGGTGGC CAGGACACTG CCTACGTCAC GACGGACGAA CTGACGCGCA CCTCAGCTGA GCTGCCACAC ACCGACGACC AGCCCGGTCG CGGTGGACCT GGCCGTGGTG GCCCCGGACA GGGCCGTGGG AGAGGACAGG ACCGTGCAAC GTCGGTCCAG CGCCGCCACG AGGTCGTCCA GCCGGGTCCC GGCACCCAGT CACAGCTCAG CGCCGAGACG ACCGCGGACT CTCACTCGCT GTTCGTCGAC CTCGAGGGCG TCGGCAACGC GACCGAGGGA ACCGTTCGCA TCCGAAACCC TGACGGAACC GTCGTCCACG AGATCGACAT CGACGCGAAG GCTGATCCGC GAAATCAGGC CGTGCGAACG CACGACTACG AGGAAATCTT CGTGCGCCGC CCCGAGGCCG GCCAGTGGAC TATCGAGGCG GAGAGCGACG CCGAACTCAA CGTCCTCACG ACGGTCGTCG ACGTGGCGGA CGACGAGGAG ATCCCGGACC CAGTGGAGGT GCTCGGCTAC GAGCAGCGCG AGTACTCGGT CAACCCAATG GAGTTCTTCG CGGATCTCGA CGATGACGTC GTCGACGGCG ATATGGACGG CATGAGCGTC CACCACGTGA GCGTCGGCCG CCTCCTCCGG GGGAACTCCG GCATGCGCCA CTACGACAAG GTCGTCGTCT CCCACGACGA CGGCATCGAC GACCCAGACT ACATCGGTGC ACTCGAGGAC TTCGTTGAGG CCGGCGGGGA TCTGATCCTC ACGGATTCGG GTGTCAACCT GCTCGCGGTG CTCGAGACCG GTGGGGCCAC AGCCATCACG GCCGACGACA TCGCGAACAT CCTCGTGCCG TTCGCCAATC TCGAGGATCG GGACTTCGAC CACCCGCTGC TGGCCGGCAT CAGAACCCGG CAGCAGGAGA TCTGGAAGGG GTCACAGCTC GGCTACACGA CTGGCGTCGA CCAGCCTGCG ACCATTGTCG ATGTGGACGC CTTCGAGACG GCTGGCGGTT CCGTCGCCGG GACGTTCACC ACGGCCGCGC TGCAGGACGG CGAATCGACC CAACTCGAGT CGGGTGTCGC TGCGGGGCTG CTTCCCGCAG CCGGCGATGG CGATGGGGCC GAAGGTGGAG ACGGAAACGG AAACGGAAAC GGAAACGGAA ACGGAAACGG AGATGCCGGC GAAATCGCCG TCGTCGGCTC GGTGCTCCCA CCGGCCCAGC AGACCGAACT CCATCCGTTC GGAATGGCGG ACTACGCGGT GTCGTTCATG GGTCACACCC TGCTGTGTAA CGCGCTCGGC TTCGAACAGC GTCGCTACGC TGATGGAGAA CTGGTGAGAA CGTACGGTGA GATACGGTAG
|
Protein sequence | MSHHTDNEYR TIADGTDAPT ETDESGVQAT FTDSTVDRRT FLSLSVATGA ALALPGSATA DVSDDVLSDE LEYVLNHTPA EYEAATSIVF TDQDVFDAFA DEYEEEPAPG RSRAPKAVTR ESPTLSAHAH LTASEVEDVL ALGDGDDDDG IDAMNFAPGS NPWWTLEEPY ADGVFPPIEE ARDYIAYEET VQALEYIEDT HSDRVRVQSI GESPGWTNLY TDEDADPRDV YVAEVTNDVQ DDSSFAAKEK VVYSLNIHGD ERAGTEAGCR LIEEIARGEA DDFEHLLDDI VLLFLFTNPD GWVSRKPQTE IPWVADHNTN FQRGNASTFE GQPVDTNRQY PTMGWTNPSF RPAEPEGAPE EFHDLVPDSL AIVEHFREYD NVAFLCDYHG MYTADHMVFN LETNAPFDHD GTHDLDEVNI QIGEGMQEFW GDIDAVADDI ATAGEEMYGS PFVPDGDSYG GLFDWGTIYD SLSYQITGGF LGWAGQPAEF GGLGAITVAP EIILSNHSVA AQKEWMPYWT RHYEEAYRIS MREYAAMTAR ETHATVETGG QDTAYVTTDE LTRTSAELPH TDDQPGRGGP GRGGPGQGRG RGQDRATSVQ RRHEVVQPGP GTQSQLSAET TADSHSLFVD LEGVGNATEG TVRIRNPDGT VVHEIDIDAK ADPRNQAVRT HDYEEIFVRR PEAGQWTIEA ESDAELNVLT TVVDVADDEE IPDPVEVLGY EQREYSVNPM EFFADLDDDV VDGDMDGMSV HHVSVGRLLR GNSGMRHYDK VVVSHDDGID DPDYIGALED FVEAGGDLIL TDSGVNLLAV LETGGATAIT ADDIANILVP FANLEDRDFD HPLLAGIRTR QQEIWKGSQL GYTTGVDQPA TIVDVDAFET AGGSVAGTFT TAALQDGEST QLESGVAAGL LPAAGDGDGA EGGDGNGNGN GNGNGNGDAG EIAVVGSVLP PAQQTELHPF GMADYAVSFM GHTLLCNALG FEQRRYADGE LVRTYGEIR
|
| |