Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3295 |
Symbol | |
ID | 8826159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 3426004 |
End bp | 3428919 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | leucyl-tRNA synthetase |
Protein accession | YP_003481407 |
Protein GI | 289582941 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.668741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACC AGTACGATCA CGCGCAAGTA CAGGAGTTCT GGCAGTACGT CTGGGAACGC GACGGCGTCG CCGAACTCCC GGACGGGGCC GTCGATCCAA CCTACGTTCT CGGGATGTTC CCCTACACCT CCGGGACGCT CCACATGGGT CACGTCCGAA ATTACGCGAT CACCGACGCG TACGCCCGTT ACCGTCGGCT CCGCGGTGAC GACGTCCTGC ACCCGATGGG CTGGGACGCG TTCGGGCTCC CCGCGGAGAA CGCCGCCTAC GAACGGGCCA GCGACCCCGA ATCCTGGACC CGCGCGTGTA TCCGTCGAAT GCGCGAGGAA CTCGAGACGC TCGGCTTCGG CTACGACTGG TCACGAGAGA TTACGACCTG CGAACCGTCG TACTACCGCT GGAATCAGTG GCTGTTCAAG CGGTTCCACG AGGCCGGTCT CGTCGAATTC ACCGGCGCAA CGGTCAACTG GTGTCCGGAT TGCGAGACGG TGCTCGCCGA TGCGCAGGTT GCGGTTGACG AGGGTGGCCA AGCGGTGACA GCGACAGCGG ACGAAGCCGG CGACAACGGT GATAGTGCCG GCGGTGCACA CGACGAGAGT AACGGCAACG CACACGTACA TGAACACAGC ACCGCTCGCG TCTGCTGGCG CTGTGGCACC CCCGTCGAGC AGCGCGAACT CGATCAGTGG TTCTTCACGA TTACTGACTA CGCCGACGAA CTGGTCGACG GACTCGACGA CCTCGACCAG TGGCCGGAGG GCGTCCGCGA GATTCAGCGC AACTGGATCG GTCGACAGGA AGGGGCACGG CTCACGTTCG ACGTGTCGAC GGCAGCTGAC GAGTCGGACA CCGCCGTCGA TGTCTTCAGC ACCCGCTCCG AGACCGTCTT CGGCGCGACG TTCGTCGCCA TCTCGCCCGA ACACGACCTG GCGAGTGAAC TCGCGAACGC GGACGAGGAC GTAGCAACGT TCGTCGACCA GGCGAGAACG AGTGCACCAG ACACCGGCCA TGCCGCCCGT GACGACTTCA GCACCGCAGG CGTCAAAACC GACGCGACGG CAGAGAACCC GCACACGGGA GAGGAACTGC CGGTCTACGT CGCTGAGTAC GTCCTCGCGG ACGTGGGAAC GGGTGCGGTA ATGGGCGTTC CCGGACACAA CGAGCGCGAT CACGAGTTCG CTCGGGAGCA CGACCTGCCG GTCGAGACAG TCGTCGTTCC GCACGGCCAC AACGGCACAG GATCGGACGG TGGCGTCGCT ACAGATGCAC CCATGACCGG CGAGGGAACG CTGGTACTCG AGTCCCCTGC TGATCGCGCC AGCGAGTACG ACGGCCAACC GAGCGAGGAC GTTCGAGAGC ACCTCGTCGA CGACCACGAG GCGATCGACC CTGACGTGAC CTACCGACTC CGGGACTGGC TGATCTCACG CCAGCGCTAC TGGGGGACGC CAATCCCGGT CGTCCACTGC GAGGACTGCG GGCACGTCCT CGTCCCAGAC GAGGAGCTCC CGGTCGAGTT ACCGGAGTTC GTCCAGACCA CGGGGAATCC GCTCGACGCC GCCGAAGAGT GGAAAGAAAC GAGTTGCCCG GACTGTGGCG GCCCCGCAGA GCGCGAGACG GACACGATGG ACACCTTCGT CGACTCCTCG TGGTACTTCC TGCGATTCCT CTCGCCCGAC CTCGCGGACG CGCCGTTCGA CACGGAACTG GCGAACGAGT GGCTCCCCGT CGACGTCTAC GTCGGCGGCG ACGAACACGC CATCTTGCAC CTGCTGTACA TCCGGTTCGT GACGCGCGCG CTGGCGGATC TTGGCTTTCT CGACCAGCGC GAGCCCGTCG AACGACTCGT CAGCCAGGGG ACGGTACTCT ACGAGGGCGA GAAGATGTCC TCCTCGAGTG GGAACGTCGT TACGCCAGAT GAGTACGGCG CGGAGACGAC CCGACTGTTC GTCCTCTCGG CGGCCCACCC CGAACAGGAC TTCGAGTGGA CGGCAAACGA CGTGCGTGGT GCGTACGACC TCCAACAGGC GCTGTACAGT ATGGCGACCG AGTTCGTCGA CGAAGGCGAG ACCCGCGTCG AACGGGTGAG CCACGACGAG TTCGTCGACC GCGAAATCGA CCGCACGATC GTCGCCGCCC GCACTGAGTT CGAGCGCTTC CGGTTCCACC GCGTCGTCAC CGAGGTACAG GAACTCGCGG GACTCCTGCG CCAGTACCGC GGCTACGACC GCATCCATGG CGAGGTCTAC CGCCGCGGCC TGCTCACCAT CGCGGCGCTC ATCTCGCCGC TCGCGCCCCA CCTCGGCGAA GAGCTCTGGA ACAAGCTCCG TGGCGACGGT CTCGTCGTCG AAGCCGACTG GCCGGCACTC GAGTCCGACC CGGCGACGAT CGAGTCAGAC TACCAGCTTG AACGGCGGCT GGTGGAGACG ACGCGCGCTG ACGTGCGTGA TATTCTCGAC GTGGCGTCGA TCGATGCTCC GGACCAAATC GACCTCGTCG TGGCCGAGCC CTGGAAGTAC GAGGTTGCGA CGCGGCTCGC AGTGTCGGCT GGCGAACTGG ACGGTGACAG CGCGACCGGG ACCGTCGACA CAGCGGGTGC CGATACGATC GACGTCGGCG CGCTGGCCGA CGAGGTGGCT GTCGAGACGG ACGTACTCGC CGAGTTCGTC GCGGACCAGC GGCGTACTGA CGCTCAACAC TCGTCCTCGG AGGGACTCAC GGCGTCGCGC GAACAGACGC TTCTCGAGCA GGCGGCGTGG CTGCTCGCAG ACGAGTTCGA CGTGACTGTC AGCGTTCGCT CGGCGACGGC GGTTGGCACC GAAGACGAGA CAGCGGACGC TGCAGCGGAC GACGTCCCGG ACGCGGACGT CGCATCTCGT GCCCGGCCGG GGAAGCCGGC GATTCGGATT CAGTGA
|
Protein sequence | MTNQYDHAQV QEFWQYVWER DGVAELPDGA VDPTYVLGMF PYTSGTLHMG HVRNYAITDA YARYRRLRGD DVLHPMGWDA FGLPAENAAY ERASDPESWT RACIRRMREE LETLGFGYDW SREITTCEPS YYRWNQWLFK RFHEAGLVEF TGATVNWCPD CETVLADAQV AVDEGGQAVT ATADEAGDNG DSAGGAHDES NGNAHVHEHS TARVCWRCGT PVEQRELDQW FFTITDYADE LVDGLDDLDQ WPEGVREIQR NWIGRQEGAR LTFDVSTAAD ESDTAVDVFS TRSETVFGAT FVAISPEHDL ASELANADED VATFVDQART SAPDTGHAAR DDFSTAGVKT DATAENPHTG EELPVYVAEY VLADVGTGAV MGVPGHNERD HEFAREHDLP VETVVVPHGH NGTGSDGGVA TDAPMTGEGT LVLESPADRA SEYDGQPSED VREHLVDDHE AIDPDVTYRL RDWLISRQRY WGTPIPVVHC EDCGHVLVPD EELPVELPEF VQTTGNPLDA AEEWKETSCP DCGGPAERET DTMDTFVDSS WYFLRFLSPD LADAPFDTEL ANEWLPVDVY VGGDEHAILH LLYIRFVTRA LADLGFLDQR EPVERLVSQG TVLYEGEKMS SSSGNVVTPD EYGAETTRLF VLSAAHPEQD FEWTANDVRG AYDLQQALYS MATEFVDEGE TRVERVSHDE FVDREIDRTI VAARTEFERF RFHRVVTEVQ ELAGLLRQYR GYDRIHGEVY RRGLLTIAAL ISPLAPHLGE ELWNKLRGDG LVVEADWPAL ESDPATIESD YQLERRLVET TRADVRDILD VASIDAPDQI DLVVAEPWKY EVATRLAVSA GELDGDSATG TVDTAGADTI DVGALADEVA VETDVLAEFV ADQRRTDAQH SSSEGLTASR EQTLLEQAAW LLADEFDVTV SVRSATAVGT EDETADAAAD DVPDADVASR ARPGKPAIRI Q
|
| |