Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_4015 |
Symbol | |
ID | 8828749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | + |
Start bp | 54975 |
End bp | 56453 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | thymidine phosphorylase |
Protein accession | YP_003482108 |
Protein GI | 289937506 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACTCG AGGCCACCAC GATCGACATC GGCACATCTC GTCCGCTGGT CCTGCTGAAC ACGTTCGACG CGGACGAACT GGGTGCACAT CCGCTCGACC GCGTCCGGAT CGGGTGGGAG GGGGAGACGA CCACCGGGAT CGTCAAGCGG ACCGACGAAC TGGTCGAGCC GGGCATCGTC GGCGTGACCG AACCCCTCCA CCACGTACAC GGTGACGTCG GGATCACGCT CGCGGGGACG CCGCCGTCGG TCCGGTGCGT TCGGAAGAAA CTGGACGACG TCGAACTTGG GCGGGCGGAA CTCGAGCGGA TCGTCCGGGA CGTCCACGAA CACCGGCTCT CGGACGTCGA GTTGAGCGCG TTCGTCTCCG CGGTCTACGC GAACGGGCTC TCGCTGGCCG AGACGAAGCA CCTGACCGAG GCGATGAGCG GCGTTGGCCA GCGACTGCAG TGGGATCGCC CGGTCGTCGC GGACAAACAC TCCATCGGGG GCGTGGCGGG CAACTGCGTG ACGCCGATCA TGGTCGCGAT CGTCACCGAA GCCGGCGTGA CGATGCCCAA GACCTCCTCG CGGGCGGTAA CCTCGCCGGC CGGCACTGCG GACGTGATGG AGGTCCTCAG TGATGTCGAG TTCTCGATCG ACGAGATCGA ACGGATAGTC ACCGACACCA ACGGCTGTCT CGTCTGGGGT GGCGGTGTTG ATCTCTCGCC CGTCGACGAC GAGATCATCC GTGCGGAGAA CCCGCTATCG ATCGATCCCG AGGGACTGCT GATGGCGTCA GTGCTCTCGA AAAAACAGAG CGCCGGCTCG ACCAACGTCG TGATCGATAT CCCCTACGGC GAGGGGGCGA AAGTCGAGAG CCTGGTTGCT GCCCGCGAAC TCGCGGACGA TTTCAAGCGC GTCGGCGACC ACCTGGAGAT GGATGTCAGC TGTGCGATCA CCCACGGGAC CGATCCGATC GGCCGTGGCA TCGGTCCCGT CCTCGAGGCC CGTGACGTCC TCGCGGTGCT CGAGGGTAAC GGCCCGGATT CGCTCCGGCT GAAGTCGTTG CGGCTCGCCG AGATGTTACT CGAGCACTGT GGTGTCGACG CCTCTGCGAC CGAAATTCTC GACTCCGGCA AGGCACTCGA GCAGTTCCGG ACGATCGTTG CCGCACAGGG GGGCGATCCG GACGTCGAAT CGACGGATCT CGAGCCCGGC GAGGAGTCGA CGACCGTCCG GGCAGATCGG GCCGGGATCA CGAGTCGCGT CGACAACAGG CAGCTGTCTG ACCTCGCACG GCGTGCCGGC GCACCGCGTG ACAGCGGGGC CGGACTCGTC GTCCACCGAA CGGTTGGCGA CGAGGTCGAA CTCGGGGATC GGCTGTACAC GATCCACGCC GAAACCGAGT CCAAACTCGA GGAGGCAGTG TCGTTCGCCG ACCAACTCGA GCCGATCAGG GTTCGGAGCA AGGCCGACGC ATTAATCGAA CGACGGTAG
|
Protein sequence | MRLEATTIDI GTSRPLVLLN TFDADELGAH PLDRVRIGWE GETTTGIVKR TDELVEPGIV GVTEPLHHVH GDVGITLAGT PPSVRCVRKK LDDVELGRAE LERIVRDVHE HRLSDVELSA FVSAVYANGL SLAETKHLTE AMSGVGQRLQ WDRPVVADKH SIGGVAGNCV TPIMVAIVTE AGVTMPKTSS RAVTSPAGTA DVMEVLSDVE FSIDEIERIV TDTNGCLVWG GGVDLSPVDD EIIRAENPLS IDPEGLLMAS VLSKKQSAGS TNVVIDIPYG EGAKVESLVA ARELADDFKR VGDHLEMDVS CAITHGTDPI GRGIGPVLEA RDVLAVLEGN GPDSLRLKSL RLAEMLLEHC GVDASATEIL DSGKALEQFR TIVAAQGGDP DVESTDLEPG EESTTVRADR AGITSRVDNR QLSDLARRAG APRDSGAGLV VHRTVGDEVE LGDRLYTIHA ETESKLEEAV SFADQLEPIR VRSKADALIE RR
|
| |