Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0764 |
Symbol | |
ID | 5773405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 685965 |
End bp | 687782 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316401 |
Product | CBS domain-containing protein |
Protein accession | YP_001582098 |
Protein GI | 161528272 |
COG category | [P] Inorganic ion transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0168] Trk-type K+ transport systems, membrane components [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0514021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACAA ATCCAGAACG CGCCGTCTCA TATGTGTTAA ACAAATCTGT AACTGAATAC ATGGACAAAG ATGTCCTGAT TCTAAGTCAG AACACACTAA CAAGAGAAGC AACAAGAATG TTGCAACATT ATGAAACAGA CGACATTATT GTTACAAATG AAGACAGAGT GCCAGTAGGA ATTGTTACAG ATGAAGATAT TCTAAGTAAA GTAAGTGATG TTACAGTATA TGCGGAAGCT ACAAAATTAA AAGACATCAT GACGACTCCG CTTGTCACAA TTAATGAAAA AGCAACACTG CAAGATGCAT TACACAAAAT GAGAGATAAC AGTATTAGAA AATTACCAGT ATTGTCAAAA AAGAATCAAG TCATTGGGAT GATTTTCCAA ACGACTATTG CAAATGTAAT CAGAGATGCA ACTGCATCTG AACCTCGATT ATTGAGTCCT CCAGTAAAAG CAGTACTTGG TAATTTAGGA TTTGTTTTGC AGTTTGCAGG GGTCCTGTTA CTGGTCCCAG CAATTATTTC TACAATTTTA GAGAATACAC TTACTGCTAC AGGGATTTAT CTTACAACGG TTCTATTGTT AGTTACAGGA TTCTTTTTGA ATTCATATGG TGAAAAATCA AGTCTGAATT TGCAGCAAGC GTCGATATTG GTGTTTTCAA GCCTGTTTTT GTTGTCTTTG TTTGGAACTG TACCGTATCT GTACGTGTTT CCAAGTGAAG AGACCAATGT AGAGGTATTT GGAAATGCAT TCTTTTCAAG TGCAGCAGGG TTTACCACAG GAGGAATATC CCTATTTGAT ACCCCTGAAG AACAACTTAC TCAAAGTTTT ACATTCTATC GTAGCTACAC GCAACTCGTA GGAGGAATGA GTTTCATTTA CCTCGTAATT ACAGCGTTTT ATCCAGAATC AAAATTACAA TCAATGAGAG GTTTTATTTC AGGAAGGACG CTTCACATGA AAGAACTCTT CTCAACAATT ACAGTCATCT TTGCAGTATA CATTGTCATT GTTGCAATAT TGTTGTATTT CTTTGGACAA GAGAATCTAT TAGACGACTT TTCGCTAGCA ATGAGTACAC TTGCAACAGG AGGATTTGTC CCGTCTTCAA CAATCATTGA GAATTTAGGA TGGCAAGAAG AAGTAATTTT GATGGGAGCC ATGATACTTG GTGCATTACC ATTTACATTC CACTATGCAT TTGTAAGAAA GAAATTCCTT GCACCAAAAT TAGGAAAAGA AGTTCTCACA TACTTTGCAA TTTTAGGTGG CGCAACAATA TTGTTTATCT CAATTAGCGG ATTAGACCCA TTGGATAGTG CATTTTATTC TGTTTCTGCA AGTACCACAG CAGGTCTTCA ACTACAAAGT TTAGCAGGAT TAGGAGGATT TGCACATGCA ATTTTGATAA CTTTGATGTT CATTGGAGGG TGTGGATTTT CAACTGCAGG AGGATTGAAA ATTTTCAGAT TATTCCATCT CAGAAATTGT AGATCATTTT TCAGCAGTGT AAGAAGAAAA GAACTCTCAA CACAAACAAA AAAAGAGATT ACATCAACAT TAATCATCAT AGCATTATTC CCAGTAATCT CAGCAATAAC AGGATTGCAT CTTGCTGAGA CTGAAGATGT GTCATATCAG GATGCATTCT TTGAGGCTGC AGGAGTGATT ACCACAGGAG GATTGTCAGC AGGGGTAATT GATTCAGATA CAGATCCTGC AACAAAAATT GTCTTGGGAT TTTTAATGAT ATTTGGAAGG CTTGAGATAA TTGCAATTAT CTATATTTTT GTGCCCAGAT TAAGTTAA
|
Protein sequence | MSTNPERAVS YVLNKSVTEY MDKDVLILSQ NTLTREATRM LQHYETDDII VTNEDRVPVG IVTDEDILSK VSDVTVYAEA TKLKDIMTTP LVTINEKATL QDALHKMRDN SIRKLPVLSK KNQVIGMIFQ TTIANVIRDA TASEPRLLSP PVKAVLGNLG FVLQFAGVLL LVPAIISTIL ENTLTATGIY LTTVLLLVTG FFLNSYGEKS SLNLQQASIL VFSSLFLLSL FGTVPYLYVF PSEETNVEVF GNAFFSSAAG FTTGGISLFD TPEEQLTQSF TFYRSYTQLV GGMSFIYLVI TAFYPESKLQ SMRGFISGRT LHMKELFSTI TVIFAVYIVI VAILLYFFGQ ENLLDDFSLA MSTLATGGFV PSSTIIENLG WQEEVILMGA MILGALPFTF HYAFVRKKFL APKLGKEVLT YFAILGGATI LFISISGLDP LDSAFYSVSA STTAGLQLQS LAGLGGFAHA ILITLMFIGG CGFSTAGGLK IFRLFHLRNC RSFFSSVRRK ELSTQTKKEI TSTLIIIALF PVISAITGLH LAETEDVSYQ DAFFEAAGVI TTGGLSAGVI DSDTDPATKI VLGFLMIFGR LEIIAIIYIF VPRLS
|
| |