Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_4083 |
Symbol | |
ID | 8828817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | - |
Start bp | 125535 |
End bp | 126953 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Carotenoid oxygenase |
Protein accession | YP_003482170 |
Protein GI | 289937568 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAACG CAACCGATCG CGCCTACGAA CTCGGGTTTC GAACGGTCGA CACCGAGTAC GCCGACCGGC AGCTACCGGT TGAGGGGACC GTGCCGTCGT GGCTCTCCGG GGCGTTGATC CGGAACGGCC CGGGCCGGTT CGAGTTCGGC GGCAAGCGTG CGACCCACTG GTTCGACGGT CTGGCGATGC TCCGCCGCTA CGGGTTCGCG GACGGCACCG TTTCGTATAC AAACCGGTTC CTACGGACCG ACGCGTACGC GGCCGCCGAC ACCGGCCACG GCGCGGCGGA GTTCGCAACG GGCGACGATT CGTTCCGCCG GCCCCTGCGG TGGCTCCGGT CGCTCGGACC GCCCGAACCG ACGGACAACG CGACCGTCCA CGTCGCCCAA CTCGGCGAGC ACTTCGTCGC GCTCACCGAG GCACCGCGGC GGATCGCGTT CGATCCGGTG ACACTCGAGA CCCGCGGCGA ATTTCGCTGG CGCGACGACA TTCCGGAGCA TCTGGCGACA GCCCACCTCC AGGTCGATCC CAATCGCGAG GAAACGATCG GCTACAGCAC GGAGTTCGGT CTCTCTCCGA TGTATCACTT CTACCGGATC CCCAACGGAC GTGCCGGCCG ACGGCACGTC GCTACCGTGC CGGCCGCTGG CCCCGGGTAC GTCCACGACT GTTCGATCAC CGAGTCACAT ATCGTCATCG TGGAGACGCC GCTCCGGATC GCGATGGCGA AGGCACTGGT CCCGTGGACC GACGGATTTC TCGACCTTCT CGAGTACGAC GAGGCGGCCA CGACCAGGTT CATCGTCGTC GATTGGGACA CGGAGAGCCT CGCCGCGACG CTCGAAACGT CGCCGTTCTT CACGTTCCAC CACGTCAACG CCTACGAGGA CGACGACGAG TTGGTTCTCG ACCTCGTCGC GTTCGACGAC GACCAGATCG TCCGAGCGCT CACCTTCGAT GCCCTCTCGG AGGACGGCTT TGCCGCGGCG CCGGACGGTC GGTTTGTGAG ATTCCGGCTC CATCCCGGCG AGGGACGCGT CCGACGCTCA GAACGATACG ACGGCGGGAT GGAGCTCCCG ACAGTTCCCA AACCGGTTCG GGGCCGACAG TACCGGTACG CGTATGCACA GGCGACCGAC AGGAAGGGTG CAAACGGACT GGTCAAACTC GATGTCGAAC GGGGAACCGC GACGGAGTGG TGGGAGCGCG GCGTCTACGT CGAAGAGCCG CGGATGGTTC GACGACCGGG CGGGACGGCC GAGGACGACG GCGTCGTGAT CGCGACGGCG CTGGACACCA AACAGGAGCG ATCGATGCTG CTCGTATTCG ATGCGGAAAC GGTAGTCGAG AGAGCGCGTG CACCTCTGCC ACACGCCGTT CCGTTCGGCT TCCACGGTCG GTTCTTTCCG GCGGTGTGA
|
Protein sequence | MSNATDRAYE LGFRTVDTEY ADRQLPVEGT VPSWLSGALI RNGPGRFEFG GKRATHWFDG LAMLRRYGFA DGTVSYTNRF LRTDAYAAAD TGHGAAEFAT GDDSFRRPLR WLRSLGPPEP TDNATVHVAQ LGEHFVALTE APRRIAFDPV TLETRGEFRW RDDIPEHLAT AHLQVDPNRE ETIGYSTEFG LSPMYHFYRI PNGRAGRRHV ATVPAAGPGY VHDCSITESH IVIVETPLRI AMAKALVPWT DGFLDLLEYD EAATTRFIVV DWDTESLAAT LETSPFFTFH HVNAYEDDDE LVLDLVAFDD DQIVRALTFD ALSEDGFAAA PDGRFVRFRL HPGEGRVRRS ERYDGGMELP TVPKPVRGRQ YRYAYAQATD RKGANGLVKL DVERGTATEW WERGVYVEEP RMVRRPGGTA EDDGVVIATA LDTKQERSML LVFDAETVVE RARAPLPHAV PFGFHGRFFP AV
|
| |