Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1222 |
Symbol | |
ID | 3785561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1404161 |
End bp | 1406827 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637811307 |
Product | von Willebrand factor, type A |
Protein accession | YP_411917 |
Protein GI | 82702351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCGA GTGACGGGTA TGTCGTGGGC ACCACGAAGA TCGTTGATCA CGGCTCCGAT AGTGTCCGAT GGAATCTGGT CATCCTGGGT GATGGATATC GTGCGACCGA GTTGGCGCAG TATCATACCG ACGTACAGAA CTTTGTCACG GCCCTGCGCA CTACTCCGCC CCTGGACGAA CTCTTCTGCG GGATCAATGT ACACCGTGTC GACATCGTGT CGAATGAAAG CGGCGCGGAC GATCCGGGAT GCGCCGGAGG AACTCCAACC ACGGCCAATA CCTATTTTGA CGCGACATTT TGCAGCATCT TTGCCGGAAC ACCGCTCGAT CGTCTGCTCA CCGTGGATGC CGCGCTCGCG TTGTCCGTTG CGTCCGCCCA GGTCCCGCTC AGGCATCAGG TGCTTTGCAT AGTCAATTCG AGCAAGTATG GAGGATCAGG TGGAATGATC GCCACGTGCT CTACCCATGC CCAGGCGGCG CAGATAGCGA TCCACGAAAT GGGACACAGC GCTTTTGGAC TGGCCGATGA GTATGGAGGG AATGGCGCAG GCACACCGGC GGGAGAACCC TCTCAGCCGA ATGTAACACG CGATACCAGC CGCACCACTA ACAAGTGGCG CGACCTCATC GCGCCGGGCA CTCCCATGCC GTCACAATGC GACCCCGGTT GCGCGGCCTC CGTCTGCGTC CCACCCGCGA TGCCTCCCGC AGCGGGTGCG GTAGGCACCT ATGAAGGCGC AATTTACTCG GACTGCGACA CCTATCGCCC TCTGCCAAGC TGTTATATGC GGGATTACGC CCCTTTCTGT CCCGTCTGCG CAGGCGTAAT CCGGCAAACG CTGCAACTGT TTCTTCCCCC TGAATCGATC ACCCTGACCA CCCCGAGTAT AAACTTCCTG AATGTTCCCG CCGGGATGGG GGGTGTCGGT GTAACGACAC ACCGGGCCAT CCTATGGGAA GTGGTGTCCT GCCGAAGCCT GACATTTGAG ATGACCGCAG GCCCTACCGG CGGATTCGGT ACCCCGAATG GCACCTCCGT TGCTGTGGCG AGCGATCCGA TTCTGCCGGT TACCTACGCA CGGCTCTGGC TGTCCTACAC CTCCACGAAT CCAGGCGATA CTGCAAGCGG CAACGTTACC GTGCGTTGCG TCCAGACGGG GGAAAGCTGG GTAATCAACA TTGCTGCCAA CACGACTGCG CGTCCCCGTT CTGCAGTCGC GCTCGTGCTC GATCGCTCCG GCAGCATGAA TGAAGATGCT GGCGATGGCA TCTCCAAGGT CCAGAAACTG CGCGAAGCCG CCAATGTTTT TATCAGTGCC ATGCAACCGG CGGACGGTAT CGGCTTGGTC CGCTTCAACG AAGCGGCCCA GCGTCTGATG GAAATCCAGG AGGCTGGCGC TGCCCCGGGC GGTACGGGGC GGACCATTGC CCTTGAACAT ATTGCCGGAA GCGATATCGA TCCCGCTGGC GCCACTTCCA TCGGTGACGG TGTCGTGAAT GGAAAGCAGA TGCTGGATGA CGCGCAAGCG ACGGCGGGTA CGCCGTACGA TGTTACCGCT ATGGTCGTGC TCACCGATGG AATGTGGAAT CGTCCCCCGC CTCTAGCCGA CGTCATGGGC AGCATCACGG CAAATACCTA TGCGGTTGGC CTGGGACTGC CCTCCAACAT CAGCATTCCC GCACTGGCGA CGCTATGTCA GGTACATAAC GGATATCTTC TGGTGACCGG CGCACTTTCG GGCGACCAGC CCATGCGGCT CGGCAAGTAT TTCCTGCAAA TTCTCGCGGG CGTATCGAAT GCCCAGATCA CCGCCGATCC GCGGGGTATT CTCAGCAGGG AATCGGAGCA TCGTATCCCG GTTTCGATCT GTGAAGCGGA TTATGGAATG GATCTGATCG TATTGAGCCC TTTTCCGCGA GCGATCGATT TTCAGCTTGA AGCGCCGGAC GGTTCCAGTA TTACGCCAGC GTCACCACCG GGATCGACCA ATTCACGATT CATCCTGAGC CCGTATGCAT CCTATTACCG GTGCGCGTTG CCGGTGCTTC CAGCTCAGTC AGCCGGAAGC CATGCCGGGC AGTGGCATGC AATACTGAAA CTGCAGGGCG GAACCTCGGT ATCTGCTGCA CAGCACGCGG CGCTGCCTTA TGAGTTTGTC GCGCACATCT ATTCGAGCTT GACCTTCACA ACGTATGTCA GGCAAGCGAG TTTCAAGGTG GATACCATTA TTCACCTGAT CGCTGCACTG TACGAATATG ATGCGCCGCT TCAGGGAAAT GCAAGGGTCT GGGGGGAGAT AGTGCGACCG GATGGAGTTG CTGAGCTTAT CCCACTCAGC CGCGATCCAC AGGGGCAATT TACCGCTGCC TATCCACTCA AGGTGCAAGG CGTCTTCCAC ATCCGGGTGC GTGCACGCGG TGAAACGGCA CGGGGAACGC CTTTTGAACG GGAACGCACG CTTACTGCGG TTGCGACCCC GGGGGGTAAT GTATGGAATC CGAACGAACC AAAAGCCAAC GATTTCTGTA ACCTGCTGCA CTGCCTTCAG GAGAAGGATG TGATCAGCGA GGAACTCATA CACAGGCTGA AAGAACAGGG AATCGATGTG CCGACACTGT TGAAATGCCT GGATGAACGG TGTGGCGCCT GGATAAAACC CGAATAA
|
Protein sequence | MSASDGYVVG TTKIVDHGSD SVRWNLVILG DGYRATELAQ YHTDVQNFVT ALRTTPPLDE LFCGINVHRV DIVSNESGAD DPGCAGGTPT TANTYFDATF CSIFAGTPLD RLLTVDAALA LSVASAQVPL RHQVLCIVNS SKYGGSGGMI ATCSTHAQAA QIAIHEMGHS AFGLADEYGG NGAGTPAGEP SQPNVTRDTS RTTNKWRDLI APGTPMPSQC DPGCAASVCV PPAMPPAAGA VGTYEGAIYS DCDTYRPLPS CYMRDYAPFC PVCAGVIRQT LQLFLPPESI TLTTPSINFL NVPAGMGGVG VTTHRAILWE VVSCRSLTFE MTAGPTGGFG TPNGTSVAVA SDPILPVTYA RLWLSYTSTN PGDTASGNVT VRCVQTGESW VINIAANTTA RPRSAVALVL DRSGSMNEDA GDGISKVQKL REAANVFISA MQPADGIGLV RFNEAAQRLM EIQEAGAAPG GTGRTIALEH IAGSDIDPAG ATSIGDGVVN GKQMLDDAQA TAGTPYDVTA MVVLTDGMWN RPPPLADVMG SITANTYAVG LGLPSNISIP ALATLCQVHN GYLLVTGALS GDQPMRLGKY FLQILAGVSN AQITADPRGI LSRESEHRIP VSICEADYGM DLIVLSPFPR AIDFQLEAPD GSSITPASPP GSTNSRFILS PYASYYRCAL PVLPAQSAGS HAGQWHAILK LQGGTSVSAA QHAALPYEFV AHIYSSLTFT TYVRQASFKV DTIIHLIAAL YEYDAPLQGN ARVWGEIVRP DGVAELIPLS RDPQGQFTAA YPLKVQGVFH IRVRARGETA RGTPFERERT LTAVATPGGN VWNPNEPKAN DFCNLLHCLQ EKDVISEELI HRLKEQGIDV PTLLKCLDER CGAWIKPE
|
| |