Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1516 |
Symbol | |
ID | 5773066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1378731 |
End bp | 1380293 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641317167 |
Product | von Willebrand factor type A |
Protein accession | YP_001582850 |
Protein GI | 161529024 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00796283 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAATCAG TTAAGCTTCA AAACGAATCA CTAGTAGAGA TTGCCACATT TCTTGTAAGA CGATGGTCTG AAAGAGACAA CATTGTTGTA GAAATCTCAG ACAAAACTGA AACAAAAACA AGACTAAAAG AAAACAAGGT AATTCTTACA CCGCTAGAGA AAAGAGTAGG AAACGATTTT CAAAAGTACA GACAGTTTAG AACATCACTA TGGTATGAGG CAATGAGAAT AAAGTTCTGC AAGAAAATTC TCAGCAATGA TCATGCATTT GGATTCATCC TAAACACAAT GGAGACAAGA CGTGTAGAGG AACTAGGAAG AAAGATTTGG AAAGGAATGG ATGATGAAAT CATCTTCAAT TACGCCTACA TGCTTGTGGC CAGACCTCAA TTGCACACAG TTTATGGAAA AGCAAGGATT GTTGAGGCAT TCTATCAATA TTTCATGTTT GGAGCAGTCA AGGGAGAAGT TCAGTCTAGT CATTTTGAAA AAATTAGAAA GGCAGATGCA TTTGCAAAAA AAATGGTAAG CAAAGCAATT GAAGAAAATC ACGACACAGA TTGGCTTGAA AAAAATGTCA GTGAAATCAT AAAAATTCTA GAGATTGATT CTCTACTAAC AATTCCAGTA TCACTACCAT TTATGAAAGC AGGAATGCCA CTTTCTGAAG AAGAACTGCT AAGAGTCTTG AAGATAGTTT CCAAAAACAA AGAAGGAGAC TTTGGCAAAG TAGATCCTTC TGCAATATTG AAGGGAGAAG ATGTAATTGA TGAGTATAAT GTTTTACTTG ATGAAGACAA GAAAACAGAG AACAAGGGAC TGATGCCTGA AGCAATAGGA ATCCAAATCC CAACTACAAG AAATGTAGAT GAGACTGTAA TCTATGACAT GAGTTTGATT AATGGACTAA AAACAAAATT CAAAGAATGG AAGACAGGTT GGAAGGAACA ACATGTCAGA TCAGGAGAAG AGTTTGATGA GGAAAACTAC ATTGAAGGAA ATGAACCATT CTTTACAGAT ATTAAAAAAT CAATCAAAAC AAAAATTGTC ATACTGTTAG ATCATTCATC TAGTATTTCG TCGGATGCAA TTGAATACAA AAAAGCAACG CTTGCACTTT GCGAAGTCTT GGCATATCTC AAAGTAAAAT TTGCAGTCTA TGCGTTTAGT ACAGAAAACA GATCAGTTGT TTGTTGGTCC ATAAAACCAG ACAACATGAA ATGGAATAAC GTTACTGCAA AAAGATTGGC ACAAATAGTT GCAAACGGTT CTACACCACT AGCTGAAGTG TATGACAAGA TGTTTCCAAT CTTACAATCA AAGAGACCAG ACATCCTCTT GACATTGACT GATGGTGAGC CATCAGACCC TGATGCAGTC AGAAACATGA CAAAATCACT CAAAAGTCTA GGCATAAGTA TGGTCGCCTT AGGCCTGGGA CCAAATACTG TAAGGGCAAC AACTATTGCA AACAATCTAA GACATTTGGG GTATGAAAAA ACAATGGCAG TAAGCCGTCT AAGAGATATT CCAAACAAGG TAATCAAGAT TTTAGATATC TAG
|
Protein sequence | MQSVKLQNES LVEIATFLVR RWSERDNIVV EISDKTETKT RLKENKVILT PLEKRVGNDF QKYRQFRTSL WYEAMRIKFC KKILSNDHAF GFILNTMETR RVEELGRKIW KGMDDEIIFN YAYMLVARPQ LHTVYGKARI VEAFYQYFMF GAVKGEVQSS HFEKIRKADA FAKKMVSKAI EENHDTDWLE KNVSEIIKIL EIDSLLTIPV SLPFMKAGMP LSEEELLRVL KIVSKNKEGD FGKVDPSAIL KGEDVIDEYN VLLDEDKKTE NKGLMPEAIG IQIPTTRNVD ETVIYDMSLI NGLKTKFKEW KTGWKEQHVR SGEEFDEENY IEGNEPFFTD IKKSIKTKIV ILLDHSSSIS SDAIEYKKAT LALCEVLAYL KVKFAVYAFS TENRSVVCWS IKPDNMKWNN VTAKRLAQIV ANGSTPLAEV YDKMFPILQS KRPDILLTLT DGEPSDPDAV RNMTKSLKSL GISMVALGLG PNTVRATTIA NNLRHLGYEK TMAVSRLRDI PNKVIKILDI
|
| |