Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0986 |
Symbol | |
ID | 5773305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 861646 |
End bp | 863085 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641316625 |
Product | TPR repeat-containing protein |
Protein accession | YP_001582320 |
Protein GI | 161528494 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.461111 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAGTA ATTTTAAAAA AATTTCTTGT CTTTTAGCAA TTATCATACT TTGTACACAA AGTTTTACAG GAATTTCGTT TGCTCAAATT AATCAAGAAC CTGAAGTGAT GTTTAATCAA GCAACAGAAT TGTTTCACAA TGGCGAGTAC AAGGAAGCAA TATCAATTTA TGATGACATA CTAGAGATTG CGCCAAACAA CATTTCCACA TTAAAAATGA AAGGAATAGC TCAGAGTAAT CAAGGGGATC ATAAAAAATC GCTAGAACAA TTCTTTACAG TTTTACAACA CCGACCAAAT GATGTCATAT CATTAGCTGG AATGGGTGTA GGGTTAGGAT ATTTAGGTGA ATATCAAGAG GCAACATCAT ATTTTGAAAA GGCACTCAAA GAAAAGCCAA ATAGCATAGT CATACAAAAT TACATGGAAT TTATCAATAA TGTAATTACG AAATATCCAT ACACACCTAC AGAAAAGCCT GATGGGTTAG ACGGAAAAAC AACCGCATCC ATTCCAGATT GGGTAAAACC GATAGCAAAA TGGTGGTCAA CAAATAGTAT TGATGATGCA GAGTTTGTTT CAGCATTAAT GTTTATGATT AATAATAAAA TTATTGAAAT TCCTCCAGTC GAGACACAAG AAGTTTCTGA AGAAAAAATC CCAGAGTGGA TAAAGAACAA TGCAGGGTGG TGGGCAGATG GAGAAATTGA CGATGATGCA TTTGTTCAAG GAATACAATA CATGATAGAA AAAGGATTAA TCATAATCAA AGTTGAAGAA GCAACACAAA AAACACAAGA GGAGTTAGAT CATGAATTTT ATCTGTTTGA AAGATATCTA AGAGACATTT CAAACAACAT ATCAAAAGAA AAGCGTTACA TAGAATTTCC AAACCCTAGT CAAGATGTAA TCAAAAAATT TCTCAGAGAT TATGTAAAAT GGAATTTTGA AGAAGAAGTA AAGAAAGCAT CTAGTAAATT CCCAGATCCC ACATACGAAA TTATTGATGA CACATACATT GTATATTACA AAGTGTACAT TAATGAGCAA CCCTCTGGAT TGCCACTAGA TCACGTCAGC ACATTAACTG ACTCATTTGC ATTTTGGGAA CAACAAGAAT TGTCAGTTAA TAATCAAAAA TTAAAAATAA AATTCGAGGT AACAAACTTG AAGCATGAAG CAAATGTTTG GGTAACATGG GTAGTTCGAA ACCTTGGTGA AGGGGTTTTA GGACATGCAC ATCTTGGTAA AGGAATTGTC GAAGTAGCAT TAGGTGATTT CAATTGTGAT GGAAGTTTTC AATTGTATGA TGTAGATAGT GTAAGAACAA TCATGACACA TGAGTTAGGT CATTCTATAG GTTTGAAACA TGTCGAGGAT AGAACAAGTA TAATGTATCC ATCATTCAAT CCTTCATACG CATATTGTTT GCTAAGTTAA
|
Protein sequence | MLSNFKKISC LLAIIILCTQ SFTGISFAQI NQEPEVMFNQ ATELFHNGEY KEAISIYDDI LEIAPNNIST LKMKGIAQSN QGDHKKSLEQ FFTVLQHRPN DVISLAGMGV GLGYLGEYQE ATSYFEKALK EKPNSIVIQN YMEFINNVIT KYPYTPTEKP DGLDGKTTAS IPDWVKPIAK WWSTNSIDDA EFVSALMFMI NNKIIEIPPV ETQEVSEEKI PEWIKNNAGW WADGEIDDDA FVQGIQYMIE KGLIIIKVEE ATQKTQEELD HEFYLFERYL RDISNNISKE KRYIEFPNPS QDVIKKFLRD YVKWNFEEEV KKASSKFPDP TYEIIDDTYI VYYKVYINEQ PSGLPLDHVS TLTDSFAFWE QQELSVNNQK LKIKFEVTNL KHEANVWVTW VVRNLGEGVL GHAHLGKGIV EVALGDFNCD GSFQLYDVDS VRTIMTHELG HSIGLKHVED RTSIMYPSFN PSYAYCLLS
|
| |