Gene Nmul_A1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1222 
Symbol 
ID3785561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1404161 
End bp1406827 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content58% 
IMG OID637811307 
Productvon Willebrand factor, type A 
Protein accessionYP_411917 
Protein GI82702351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCGA GTGACGGGTA TGTCGTGGGC ACCACGAAGA TCGTTGATCA CGGCTCCGAT 
AGTGTCCGAT GGAATCTGGT CATCCTGGGT GATGGATATC GTGCGACCGA GTTGGCGCAG
TATCATACCG ACGTACAGAA CTTTGTCACG GCCCTGCGCA CTACTCCGCC CCTGGACGAA
CTCTTCTGCG GGATCAATGT ACACCGTGTC GACATCGTGT CGAATGAAAG CGGCGCGGAC
GATCCGGGAT GCGCCGGAGG AACTCCAACC ACGGCCAATA CCTATTTTGA CGCGACATTT
TGCAGCATCT TTGCCGGAAC ACCGCTCGAT CGTCTGCTCA CCGTGGATGC CGCGCTCGCG
TTGTCCGTTG CGTCCGCCCA GGTCCCGCTC AGGCATCAGG TGCTTTGCAT AGTCAATTCG
AGCAAGTATG GAGGATCAGG TGGAATGATC GCCACGTGCT CTACCCATGC CCAGGCGGCG
CAGATAGCGA TCCACGAAAT GGGACACAGC GCTTTTGGAC TGGCCGATGA GTATGGAGGG
AATGGCGCAG GCACACCGGC GGGAGAACCC TCTCAGCCGA ATGTAACACG CGATACCAGC
CGCACCACTA ACAAGTGGCG CGACCTCATC GCGCCGGGCA CTCCCATGCC GTCACAATGC
GACCCCGGTT GCGCGGCCTC CGTCTGCGTC CCACCCGCGA TGCCTCCCGC AGCGGGTGCG
GTAGGCACCT ATGAAGGCGC AATTTACTCG GACTGCGACA CCTATCGCCC TCTGCCAAGC
TGTTATATGC GGGATTACGC CCCTTTCTGT CCCGTCTGCG CAGGCGTAAT CCGGCAAACG
CTGCAACTGT TTCTTCCCCC TGAATCGATC ACCCTGACCA CCCCGAGTAT AAACTTCCTG
AATGTTCCCG CCGGGATGGG GGGTGTCGGT GTAACGACAC ACCGGGCCAT CCTATGGGAA
GTGGTGTCCT GCCGAAGCCT GACATTTGAG ATGACCGCAG GCCCTACCGG CGGATTCGGT
ACCCCGAATG GCACCTCCGT TGCTGTGGCG AGCGATCCGA TTCTGCCGGT TACCTACGCA
CGGCTCTGGC TGTCCTACAC CTCCACGAAT CCAGGCGATA CTGCAAGCGG CAACGTTACC
GTGCGTTGCG TCCAGACGGG GGAAAGCTGG GTAATCAACA TTGCTGCCAA CACGACTGCG
CGTCCCCGTT CTGCAGTCGC GCTCGTGCTC GATCGCTCCG GCAGCATGAA TGAAGATGCT
GGCGATGGCA TCTCCAAGGT CCAGAAACTG CGCGAAGCCG CCAATGTTTT TATCAGTGCC
ATGCAACCGG CGGACGGTAT CGGCTTGGTC CGCTTCAACG AAGCGGCCCA GCGTCTGATG
GAAATCCAGG AGGCTGGCGC TGCCCCGGGC GGTACGGGGC GGACCATTGC CCTTGAACAT
ATTGCCGGAA GCGATATCGA TCCCGCTGGC GCCACTTCCA TCGGTGACGG TGTCGTGAAT
GGAAAGCAGA TGCTGGATGA CGCGCAAGCG ACGGCGGGTA CGCCGTACGA TGTTACCGCT
ATGGTCGTGC TCACCGATGG AATGTGGAAT CGTCCCCCGC CTCTAGCCGA CGTCATGGGC
AGCATCACGG CAAATACCTA TGCGGTTGGC CTGGGACTGC CCTCCAACAT CAGCATTCCC
GCACTGGCGA CGCTATGTCA GGTACATAAC GGATATCTTC TGGTGACCGG CGCACTTTCG
GGCGACCAGC CCATGCGGCT CGGCAAGTAT TTCCTGCAAA TTCTCGCGGG CGTATCGAAT
GCCCAGATCA CCGCCGATCC GCGGGGTATT CTCAGCAGGG AATCGGAGCA TCGTATCCCG
GTTTCGATCT GTGAAGCGGA TTATGGAATG GATCTGATCG TATTGAGCCC TTTTCCGCGA
GCGATCGATT TTCAGCTTGA AGCGCCGGAC GGTTCCAGTA TTACGCCAGC GTCACCACCG
GGATCGACCA ATTCACGATT CATCCTGAGC CCGTATGCAT CCTATTACCG GTGCGCGTTG
CCGGTGCTTC CAGCTCAGTC AGCCGGAAGC CATGCCGGGC AGTGGCATGC AATACTGAAA
CTGCAGGGCG GAACCTCGGT ATCTGCTGCA CAGCACGCGG CGCTGCCTTA TGAGTTTGTC
GCGCACATCT ATTCGAGCTT GACCTTCACA ACGTATGTCA GGCAAGCGAG TTTCAAGGTG
GATACCATTA TTCACCTGAT CGCTGCACTG TACGAATATG ATGCGCCGCT TCAGGGAAAT
GCAAGGGTCT GGGGGGAGAT AGTGCGACCG GATGGAGTTG CTGAGCTTAT CCCACTCAGC
CGCGATCCAC AGGGGCAATT TACCGCTGCC TATCCACTCA AGGTGCAAGG CGTCTTCCAC
ATCCGGGTGC GTGCACGCGG TGAAACGGCA CGGGGAACGC CTTTTGAACG GGAACGCACG
CTTACTGCGG TTGCGACCCC GGGGGGTAAT GTATGGAATC CGAACGAACC AAAAGCCAAC
GATTTCTGTA ACCTGCTGCA CTGCCTTCAG GAGAAGGATG TGATCAGCGA GGAACTCATA
CACAGGCTGA AAGAACAGGG AATCGATGTG CCGACACTGT TGAAATGCCT GGATGAACGG
TGTGGCGCCT GGATAAAACC CGAATAA
 
Protein sequence
MSASDGYVVG TTKIVDHGSD SVRWNLVILG DGYRATELAQ YHTDVQNFVT ALRTTPPLDE 
LFCGINVHRV DIVSNESGAD DPGCAGGTPT TANTYFDATF CSIFAGTPLD RLLTVDAALA
LSVASAQVPL RHQVLCIVNS SKYGGSGGMI ATCSTHAQAA QIAIHEMGHS AFGLADEYGG
NGAGTPAGEP SQPNVTRDTS RTTNKWRDLI APGTPMPSQC DPGCAASVCV PPAMPPAAGA
VGTYEGAIYS DCDTYRPLPS CYMRDYAPFC PVCAGVIRQT LQLFLPPESI TLTTPSINFL
NVPAGMGGVG VTTHRAILWE VVSCRSLTFE MTAGPTGGFG TPNGTSVAVA SDPILPVTYA
RLWLSYTSTN PGDTASGNVT VRCVQTGESW VINIAANTTA RPRSAVALVL DRSGSMNEDA
GDGISKVQKL REAANVFISA MQPADGIGLV RFNEAAQRLM EIQEAGAAPG GTGRTIALEH
IAGSDIDPAG ATSIGDGVVN GKQMLDDAQA TAGTPYDVTA MVVLTDGMWN RPPPLADVMG
SITANTYAVG LGLPSNISIP ALATLCQVHN GYLLVTGALS GDQPMRLGKY FLQILAGVSN
AQITADPRGI LSRESEHRIP VSICEADYGM DLIVLSPFPR AIDFQLEAPD GSSITPASPP
GSTNSRFILS PYASYYRCAL PVLPAQSAGS HAGQWHAILK LQGGTSVSAA QHAALPYEFV
AHIYSSLTFT TYVRQASFKV DTIIHLIAAL YEYDAPLQGN ARVWGEIVRP DGVAELIPLS
RDPQGQFTAA YPLKVQGVFH IRVRARGETA RGTPFERERT LTAVATPGGN VWNPNEPKAN
DFCNLLHCLQ EKDVISEELI HRLKEQGIDV PTLLKCLDER CGAWIKPE