Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1040 |
Symbol | |
ID | 5773932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 914012 |
End bp | 916327 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316682 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001582374 |
Protein GI | 161528548 |
COG category | [C] Energy production and conversion |
COG ID | [COG3794] Plastocyanin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTCAT TTTTCACTAT GAATATGAAT AAAAAAATAT TTTTTTTATT TCTTTCATTA TCTGTATTAT TTTCATTTAT GTTAAGTGAA GATGCATTTG CTCAAACTAA ACCCGATAGA GTGAGGGGAC TTTCTGCAAC TGCAATCAGT ACGACCCAGA TTGACTTGTC TTGGAATGAA CCTTCTGACG GAGGATTGCC AATAACAGGA TACCAAATTG AACGCAAAAA AGCATCAGAC CCTTGGGAAA TCTACATTGC TGATACTGGC AATACAAACA CAACATATTC TGATCAAGGT TTAGATCCTG ATACAAGATA TCGTTACAAA GTTGCTGCAA TTAACGCAAT TGATATTGGT CGTTCATCCA CTGCCAAAGC TGCTACAACT TTTGCAATAA CAGAACCAAA TAGAGTAACA GGACTCTCGG CCACTACAAT CAGTCCAACA CAAATTGACT TGTCTTGGAA TGAACCATAC GATGGTGAGT CTCCAATAAC AGGATACCAA ATTGAACGCA AAAAAGCATC AGACCCTTGG GAAATCTACA TTGCTGATAC TGGCAATACA AACACAACAT ATTCTGATCA AGGTTTAGAT CCTGATACAA GATATCGTTA CAAAGTTGCT GCAATTAATG CAATTGGGAT TGGAACTGCA TCCACTGCCA AAGCTGCTAC AACTACTGAG ATAACAGAAC CCGATAGAGT AACAGGACTT ACTGCGACTG CAATTAGCCA TACTCAGATA AACTTGTCTT GGAATGAACC ATACGATGGT GAGTCTCCAA TAACAGGATA CCAAATTGAA CGCAAAAAAG CATCAGACCC TTGGGAAATC TACATTGCTG ATACTGGCAA TACAAACACA ACATATTCTG ATCAAGGTTT AGATCCTGAT ACAAGATATC GTTACAAAGT TGCTGCAATT AACGCAATTG ATATTGGTCG TTCATCAACT ATTGTAACTG AGACTACGTT ACTACCTGGT GTTTATTTGC CCCCTGTTGA TACCTCTACA GGCACGTCAA AAATTAGACC TCCTCCACAA ATTCAAGGAA TTGGTCTTTA CAAATTCACA ACACATGTTG GTGATGATGG TGATACCAAA GAATATGATG CTCCTCTTAA TCTCCCATTT GATCAATATT TCCCATACAG TAAATTTTCA GATGAAACTG ATTTTAAGAA TTACAAAGAA ATGGGAGCGT ATAAAAAATT AGGACTATAT CATGATTTTT CTAAAAAAAC ACTAGCTCCT AAATTCTTTG CAGAAACAAA TCAACCAGTT CAACTTCAAA TTCGTTTATG GGATATGCTT TCAAGTTCAA AAATTGAACA TCTTTCATTA TACACTTATT CTACATCCTC TACAACTGTA GAAAACAGTG ACGTTGAAAT TATTTTTGAT AAAGGAAAAC CTCTTGATGT TATAGATCCT AATGATATTT TCAAGTATGT TGAAGTATAT CCTTCTTATG AAGATGAATG GTTATGGATT AATCTTGATT TGATGTTTCA AAAGCCTATG ACTTCATCTA ATATTCTACT GCAATCCTGG CATGAATCTA GAATCCCTTC ATTTGTTCAA GTTGATGATA TTTGGGAAAT CTCTAATCCT CAATCAAATA CTGATCATGT TGATGAGGTT AATCTTACTG AAGAAGTTGA AATTACTCAT GGTACATCAA ATCCCACATG TAAAATGGAT GATTCATGTT TTACTCCCTC CGATGCAAAA ATCTTAGAAG GGGGAATTAT AACTTGGTTT AACACCGATT CTTTTACACA CACTGTAACT AGTGGTTCTG TTAATAATAA TGATAATAGA TTTGGATACA TTTTGTTTCC TGGTCAAACA GTACAACATG AATTTCCTTA CAAAGGAATC TATGATTATT ATTGTGCACT TCATCCTTGG GCTAATGGCT CTGTCATTGT ATATGGTGCA GATTTTGAAA AACCAGAGAC TAGTTTTGAT GAATCACAAC CAACATTACT TGTAAAATCT ACCTCTGGGG GCTCGTTAAT AATTGAAAAT AATGATGTGT ATGTTACTTC GTCAAGGGAT CTCCACATGA ATATCTCTGG ACATATTCAA GAACTATCTA CATCAAATAC TGTTAAAATC ATAATTATTC ATCCAGATAA AATCACAAAA CACATGACTG CTATTGTAAA TAGTGATGGA TTTTATTCCA TGCCTGTAGT TCTTAACAAA CACTGGATGG AAGGAACCTA CGAAATAATT ACTGAATATC GTGGTGAACA AGTGGCACAA TTATCTTTTC TAGTATCTGA CAAACCTGTA AGATGA
|
Protein sequence | MRSFFTMNMN KKIFFLFLSL SVLFSFMLSE DAFAQTKPDR VRGLSATAIS TTQIDLSWNE PSDGGLPITG YQIERKKASD PWEIYIADTG NTNTTYSDQG LDPDTRYRYK VAAINAIDIG RSSTAKAATT FAITEPNRVT GLSATTISPT QIDLSWNEPY DGESPITGYQ IERKKASDPW EIYIADTGNT NTTYSDQGLD PDTRYRYKVA AINAIGIGTA STAKAATTTE ITEPDRVTGL TATAISHTQI NLSWNEPYDG ESPITGYQIE RKKASDPWEI YIADTGNTNT TYSDQGLDPD TRYRYKVAAI NAIDIGRSST IVTETTLLPG VYLPPVDTST GTSKIRPPPQ IQGIGLYKFT THVGDDGDTK EYDAPLNLPF DQYFPYSKFS DETDFKNYKE MGAYKKLGLY HDFSKKTLAP KFFAETNQPV QLQIRLWDML SSSKIEHLSL YTYSTSSTTV ENSDVEIIFD KGKPLDVIDP NDIFKYVEVY PSYEDEWLWI NLDLMFQKPM TSSNILLQSW HESRIPSFVQ VDDIWEISNP QSNTDHVDEV NLTEEVEITH GTSNPTCKMD DSCFTPSDAK ILEGGIITWF NTDSFTHTVT SGSVNNNDNR FGYILFPGQT VQHEFPYKGI YDYYCALHPW ANGSVIVYGA DFEKPETSFD ESQPTLLVKS TSGGSLIIEN NDVYVTSSRD LHMNISGHIQ ELSTSNTVKI IIIHPDKITK HMTAIVNSDG FYSMPVVLNK HWMEGTYEII TEYRGEQVAQ LSFLVSDKPV R
|
| |