Gene Nmag_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1698 
Symbol 
ID8824538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1730262 
End bp1733261 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content64% 
IMG OID 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_003479836 
Protein GI289581370 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACACC ATACAGACAA CGAATACAGG ACAATAGCCG ACGGGACCGA CGCACCGACT 
GAAACCGACG AGTCTGGTGT TCAAGCCACG TTTACCGACT CGACTGTCGA CCGACGAACG
TTCCTCAGCC TCTCTGTTGC AACCGGCGCT GCGCTCGCAC TCCCCGGCAG CGCCACCGCC
GACGTAAGTG ACGACGTGCT CAGCGACGAA CTCGAGTACG TCCTGAATCA CACGCCTGCG
GAGTACGAGG CTGCGACCAG CATCGTCTTC ACCGATCAGG ACGTCTTCGA CGCGTTCGCG
GACGAGTACG AGGAGGAGCC AGCCCCAGGC CGATCGCGCG CACCGAAGGC GGTCACTCGC
GAGTCGCCAA CACTGTCTGC ACATGCACAC CTTACGGCGT CAGAGGTCGA GGACGTGCTC
GCGCTGGGTG ATGGCGACGA CGATGATGGT ATCGACGCAA TGAACTTCGC ACCCGGTTCG
AACCCCTGGT GGACGCTCGA GGAGCCCTAC GCGGACGGCG TCTTCCCGCC AATCGAGGAG
GCTCGCGACT ACATCGCCTA CGAGGAGACG GTGCAGGCAC TCGAGTACAT CGAGGACACG
CATTCGGACC GCGTTCGGGT GCAGTCGATC GGCGAGTCGC CAGGGTGGAC GAATCTGTAC
ACCGACGAGG ACGCCGATCC GCGGGACGTC TACGTTGCGG AAGTCACGAA CGATGTGCAG
GACGACTCGT CGTTCGCCGC GAAGGAGAAG GTCGTCTACT CGCTCAACAT TCACGGCGAC
GAGCGGGCGG GAACGGAGGC TGGCTGTCGG CTGATCGAGG AAATCGCACG CGGCGAGGCG
GACGATTTCG AGCACCTGCT CGACGATATC GTCTTGTTGT TCCTGTTTAC GAACCCGGAC
GGCTGGGTTT CGCGCAAGCC ACAGACCGAG ATTCCGTGGG TTGCCGACCA CAATACCAAC
TTCCAGCGTG GCAACGCGAG TACGTTCGAG GGGCAGCCAG TCGACACCAA CCGTCAGTAT
CCGACGATGG GCTGGACGAA TCCGAGCTTC CGGCCAGCGG AGCCGGAGGG TGCGCCTGAG
GAGTTCCACG ACCTCGTGCC TGACTCACTC GCCATCGTCG AGCACTTCCG CGAGTACGAC
AACGTGGCGT TCCTCTGTGA CTACCACGGG ATGTACACCG CGGATCACAT GGTGTTCAAC
CTCGAGACGA ACGCGCCGTT CGACCACGAC GGCACGCACG ACTTAGACGA GGTCAACATC
CAGATCGGCG AGGGAATGCA GGAGTTCTGG GGAGACATCG ATGCGGTCGC AGATGACATC
GCCACGGCCG GCGAGGAGAT GTACGGCTCG CCGTTCGTGC CGGACGGCGA CAGCTACGGC
GGCCTGTTCG ACTGGGGAAC GATCTACGAC TCGCTGTCCT ACCAGATCAC TGGCGGCTTC
CTCGGCTGGG CCGGCCAGCC AGCGGAGTTC GGCGGACTCG GTGCGATCAC CGTCGCCCCT
GAAATCATCC TCTCGAACCA CTCTGTAGCG GCTCAGAAGG AGTGGATGCC CTACTGGACG
CGCCACTACG AGGAGGCCTA CCGGATCTCG ATGCGCGAGT ACGCTGCGAT GACGGCCCGC
GAGACGCACG CGACGGTCGA AACCGGTGGC CAGGACACTG CCTACGTCAC GACGGACGAA
CTGACGCGCA CCTCAGCTGA GCTGCCACAC ACCGACGACC AGCCCGGTCG CGGTGGACCT
GGCCGTGGTG GCCCCGGACA GGGCCGTGGG AGAGGACAGG ACCGTGCAAC GTCGGTCCAG
CGCCGCCACG AGGTCGTCCA GCCGGGTCCC GGCACCCAGT CACAGCTCAG CGCCGAGACG
ACCGCGGACT CTCACTCGCT GTTCGTCGAC CTCGAGGGCG TCGGCAACGC GACCGAGGGA
ACCGTTCGCA TCCGAAACCC TGACGGAACC GTCGTCCACG AGATCGACAT CGACGCGAAG
GCTGATCCGC GAAATCAGGC CGTGCGAACG CACGACTACG AGGAAATCTT CGTGCGCCGC
CCCGAGGCCG GCCAGTGGAC TATCGAGGCG GAGAGCGACG CCGAACTCAA CGTCCTCACG
ACGGTCGTCG ACGTGGCGGA CGACGAGGAG ATCCCGGACC CAGTGGAGGT GCTCGGCTAC
GAGCAGCGCG AGTACTCGGT CAACCCAATG GAGTTCTTCG CGGATCTCGA CGATGACGTC
GTCGACGGCG ATATGGACGG CATGAGCGTC CACCACGTGA GCGTCGGCCG CCTCCTCCGG
GGGAACTCCG GCATGCGCCA CTACGACAAG GTCGTCGTCT CCCACGACGA CGGCATCGAC
GACCCAGACT ACATCGGTGC ACTCGAGGAC TTCGTTGAGG CCGGCGGGGA TCTGATCCTC
ACGGATTCGG GTGTCAACCT GCTCGCGGTG CTCGAGACCG GTGGGGCCAC AGCCATCACG
GCCGACGACA TCGCGAACAT CCTCGTGCCG TTCGCCAATC TCGAGGATCG GGACTTCGAC
CACCCGCTGC TGGCCGGCAT CAGAACCCGG CAGCAGGAGA TCTGGAAGGG GTCACAGCTC
GGCTACACGA CTGGCGTCGA CCAGCCTGCG ACCATTGTCG ATGTGGACGC CTTCGAGACG
GCTGGCGGTT CCGTCGCCGG GACGTTCACC ACGGCCGCGC TGCAGGACGG CGAATCGACC
CAACTCGAGT CGGGTGTCGC TGCGGGGCTG CTTCCCGCAG CCGGCGATGG CGATGGGGCC
GAAGGTGGAG ACGGAAACGG AAACGGAAAC GGAAACGGAA ACGGAAACGG AGATGCCGGC
GAAATCGCCG TCGTCGGCTC GGTGCTCCCA CCGGCCCAGC AGACCGAACT CCATCCGTTC
GGAATGGCGG ACTACGCGGT GTCGTTCATG GGTCACACCC TGCTGTGTAA CGCGCTCGGC
TTCGAACAGC GTCGCTACGC TGATGGAGAA CTGGTGAGAA CGTACGGTGA GATACGGTAG
 
Protein sequence
MSHHTDNEYR TIADGTDAPT ETDESGVQAT FTDSTVDRRT FLSLSVATGA ALALPGSATA 
DVSDDVLSDE LEYVLNHTPA EYEAATSIVF TDQDVFDAFA DEYEEEPAPG RSRAPKAVTR
ESPTLSAHAH LTASEVEDVL ALGDGDDDDG IDAMNFAPGS NPWWTLEEPY ADGVFPPIEE
ARDYIAYEET VQALEYIEDT HSDRVRVQSI GESPGWTNLY TDEDADPRDV YVAEVTNDVQ
DDSSFAAKEK VVYSLNIHGD ERAGTEAGCR LIEEIARGEA DDFEHLLDDI VLLFLFTNPD
GWVSRKPQTE IPWVADHNTN FQRGNASTFE GQPVDTNRQY PTMGWTNPSF RPAEPEGAPE
EFHDLVPDSL AIVEHFREYD NVAFLCDYHG MYTADHMVFN LETNAPFDHD GTHDLDEVNI
QIGEGMQEFW GDIDAVADDI ATAGEEMYGS PFVPDGDSYG GLFDWGTIYD SLSYQITGGF
LGWAGQPAEF GGLGAITVAP EIILSNHSVA AQKEWMPYWT RHYEEAYRIS MREYAAMTAR
ETHATVETGG QDTAYVTTDE LTRTSAELPH TDDQPGRGGP GRGGPGQGRG RGQDRATSVQ
RRHEVVQPGP GTQSQLSAET TADSHSLFVD LEGVGNATEG TVRIRNPDGT VVHEIDIDAK
ADPRNQAVRT HDYEEIFVRR PEAGQWTIEA ESDAELNVLT TVVDVADDEE IPDPVEVLGY
EQREYSVNPM EFFADLDDDV VDGDMDGMSV HHVSVGRLLR GNSGMRHYDK VVVSHDDGID
DPDYIGALED FVEAGGDLIL TDSGVNLLAV LETGGATAIT ADDIANILVP FANLEDRDFD
HPLLAGIRTR QQEIWKGSQL GYTTGVDQPA TIVDVDAFET AGGSVAGTFT TAALQDGEST
QLESGVAAGL LPAAGDGDGA EGGDGNGNGN GNGNGNGDAG EIAVVGSVLP PAQQTELHPF
GMADYAVSFM GHTLLCNALG FEQRRYADGE LVRTYGEIR