Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_1535 |
Symbol | |
ID | 3676560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | - |
Start bp | 1676665 |
End bp | 1681884 |
Gene Length | 5220 bp |
Protein Length | 1739 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637713089 |
Product | extracellular alpha-helical protein |
Protein accession | YP_318148 |
Protein GI | 75675727 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.232844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.84967 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGGC TGGTTCGTGC CGTAGCCTTT TGCGCGACCT TGCTCATCGG GCTGGGATCG GTCGCCCATG CGGCGGATAA AGCTTTCAAG CGCGACGATC TGGCCGATGC GGCGATCAAG CTCGAAGCCC GGATCAAAAG TGAAGCCGGC GCGGTCGCCA AATCAGGCTC CATGCTGCGT ACCGACGCCG ACACGGCCTT CAAACGAGCC GATTTTCGCA GTGGCCTTCA AATACTCGGC CAGATCGCAA CCGTTGCACC GAACGATAGC GGCAACTGGC TACGCCTGGC GCGGACCATC TTCCAGATCA AACCCGCCAC AAGTCAGGAA CAGACCTTCC TGCTGGAGCG CGCGTCCACC GCTGCCTATC TCGCCTACCA GCGGGCGACC GATCCGGGGA CCGAGGCTGA TGCATTGGCG GTTCTCGGTC GCGCCTTTTC GGATCGAAAG CTGTGGCGGC CGGCCCTCGA CGCGTTGCGG CTCTCTCTCG ATCTGCGCGA AGTGGCCAGC GTTCGCGAGC AATACGAAAA AATGCGGGAC GACCACGGCT TCCGGCTGCT CGATTACACG GTAGATTCGG ATTCGGCGAC GCCTCGGGTC TGTTTTCAGT TCTCCGAGGA TCTTGCCAAG AGGGTGGACT TCTCGCCCTT CGTCGCGCTC GCGGGAGACG ACCGGCCGGC GCTGTCCTCG GAGGACAGGC AACTTTGTGT CGAGGGTCTC AAGCACGGTG AGCGCTACAA TATCAACCTG CGCGCCGGCC TGCCGTCGGC CGTGAAGGAA AGTCTGCCGA AATCCGCTGA ATTCAACGTC TATGTGCGCG ACCGCAAGCC GTTCGTCCGC TTCACCGGAC GAGCCTACGT GCTGCCGCGC ACCGGCCAGC GCGGCATCCC GCTTGTCAGC GTCAACACCC AGAACGTGAC CGTCAAGGTG TTCCGGATCG GCGACCGCAA CCTCATCAAC ACCGTTGTCG AGAGCGACTT TCAGAAGGCG CTCGGCAGCC ATCAGTTATA TGAGCTTGGC CACGAACGCG GGATCAAGGT CTGGTCGGGC GAGGTGACGA CGGCCTCCAC CCTGAATGCC GATGTGACGA CGGCGTTTCC GGTCGATGAG GCGCTCGGAA ACCTCCAGCC CGGCGTTTAC GTGATGACGG CCGCGCCCAA GGGTCCGGGT TCGACGGATG ACGACGAGTC CGGCTCGCTG GCGACCCAGT GGTTCATCGT CTCGGACCTC GGCCTGAGCG CCTATTCAGG CAATGACGGC ATCCACGTCT TCGTCAATTC GCTGGCGACA ACCGATGCCG TGGACAAGGC GGAAGTCCGC CTGGTCGCCC GCAACAATGA AATCCTGGCA ACCCGGAAGA CGGATGCATC CGGGCACGCG CTATTCGAGC CGGGGCTGGC GCGAGGCGAA GGAGGGCAGT CGCCCGCGTT GCTGACAGTC AGCACCGACA AAGCCGATTA TGCGTTCCTC AGCCTGAAGT CGAATGCATT CGATCTGACC GATCGCGGCG TCTCGGGCCG CGCCGTGCCA TCGGGAGCCG ATGCCTTCGT TTATGCGGAG CGCGGCGTCT ATCGCTCGGG GGAGACCGTG TATCTCACGG CGTTGCTGCG GGACGGGCAG GGCGTCGCCG TGACCGGCGG CCCCCTGACG CTGGTGGTCG AACGGCCTGA TGGCGTCGAG TACCGCCGCG CAGTCCTGTC GGATCATGGC TCGGGCGGAC GAAGCCTGGA CCTGCCGCTC AATTCAGCGG TCCCGACCGG AACATGGCGG GTTCGCGCGT TTACGGACCC CAAAGGACCA TCCGTCGGCG AAACCACCTT CATGGTCGAG GACTATGTCC CGGACCGGAT CGAATTCGAT CTGACTACCA AGGCGAAACA GATTGCCGCT GATAATCCGG TAGAACTCAA GGTCGACGGC CGTTTCCTCT ACGGCGCTCC CGCGTCCGGC CTGCAGCTCG AAGGCGAGTT ACTGGTTGCG CCTGCCGAGA GCCGTCCAGG CTATGCGGGG TATCGGTTCG GCGTGCCCGA TGACGAGGCC GCCAGTAACG AGCGCACACC GATCGAAAAC CTGCCCGAAG CCGATGCGAA CGGCGTCGCC ACGTTTCCCG TCAGCCTCGC GACGGCCCCG TCATCGGACC GGCAGCACGA AGCGCAGATT TTCATTCGCA TGGCGGAGGC GGGCGGCCGC GCGGTCGAAC GGAAGATCGT GCTTCCGGTG AAACCGTCCG CCGCCATGAT CGGCGTCAAA CCGTTGTTCG CCGACAAGAG TGTCGCGGAG GGGGACAGGG CCAGATTCGA CGTCGTCTTT GTGGCGCCGG ACGGAACGTC GCTTGCCCGC AAGGGTCTGC GCTACGAACT GCTCAAGCTG GAGAGCCGCT ATCAGTGGTA TCGGCAGAAC TCGTACTGGG AATACGAGCC GGTGAAATCG ACCAGACGGG TGGCCGATGG CGACCTCTCG ATCGCCGCCG ACAGTCCCGC GCGGATCGAA CTCTCCCCGC AGCCGGGGCG TTACCGGCTC GACGTCAAAT CCTCCGATTC AGACGGCCCG CTGACATCTG TTCAGTTCGA CGTCGGCTGG TATTCCGACG GCAGCGCCGA CACGCCTGAC CTGCTGGAAA CCTCGATCGA CAAGCCGGAT TACCAGTCCG GCGACACCAT GGTCGTTTCG GTTAACGCCC GGACCGCGGG CAAGCTCACG ATCAACGTTC TAGGTGACCG GCTGCTGACC ACCCAAACCA CCGAGGTCAA GGAAGGCACG TCGCAGGTCA AGATCCCGGT CGGCAAGGAC TGGGGCACCG GGGCCTATGT GGTCGCGACG CTGCGGCGGC CGCTCGACGT TGCCGCGCAG CGGATGCCCG GCCGCGCGAT CGGCATCAAA TGGTTCGGCA TCGACAGGAG CGCGCGCACC CTTTCGGTCA ACCTCTCGCC GCCGGAACTG GCGCGACCAT CCGCACCGCT TAAGCTGCCT GTGAAGGTCG GCGGTCTAAG CCCCGGCGAA GACGCCAAGA TCGTCGTCGC CGCCGTTGAC GTCGGCATCC TCAATCTCAC CAACTACAAA CCGCCAGCGC CCGACGACTA CTATCTGGGC CAGCGTCGCT TGACGTCTGA AATCCGTGAT CTCTACGGAC AACTGATCGA CGGGATGCAA GGGACGCGCG GCCAGCTCAG GACCGGCGGC GATTTCGCGG GAGCGGAGTT GCAGGGCAGC CCGCCGACGC AGAAGCCGCT CGCGCTCTAT TCGGGCATCG TCACGGTCGC CGCGGACGGC ACGGCCGAAA TCAGCTTCGA CATTCCGGAG TTCGCGGGCA CGGCGCGCGT GATGGCGGTC GCCTGGACCG CGACCAAGCT CGGGCGTGCG ACGGTCGATG TCACGGTGCG TGATCCGGTG GTGCTGACGG CGACCCTGCC GCGCTTTCTG TTGACCGGCG ATCGGGGCAC GATGAGCTTC GATCTCGACA ATGTCGAAGG TCCGCCAGGA GATTATACCG TCAACGTCAG AACGTCCGGG CCGGTGAAGG TAGCGGGCAA TGCCACGACC GCGATCAAGC TCGCGGCCGG GCAGCGCAGT TCGATGGCGC TGACGCTCGA TACCGCGGGC TCCGCCGGCG CTGCCCGGTT CGACATCGAT ATCAAGGGAC CGAACGGCCT GAGGCTGGCG CGGCATTACG ACCTCGAGGT TAAGCCCGCG ACACAGATCC TGGCGCGTCG CTCTGTGCGA ACGCTGGCGA AGGGCGAGAG CCTGACGCTG ACCTCGGATA TGTTCTCGGA CCTCGTGGCG GGCACGGGAG GGGTGTCGAT GTCGGTCGGC CTGTCCTCGG CGCTGGACGC GGCGAGCGTC CTTAAGGCGC TCGACCGTTA TCCGTTCGGC TGCTCGGAGC AGATCGCAAG CCGGGCGATG CCGCTGCTCT ACGTCAATGA TCTCGCAGCG GAAGCCCACC TCGCGATGGA TACCAGTATC GACGATCGCA TCAAAAGCTC GATCGAGCGG CTGCTGGCAA GGCAGGGCTC GAATGGTTCC TTCGGCCTGT GGTCGTCCGG CGGCGAGGAT TCCTGGCTCG ACGCCTACGT CACGGACTTC CTGACGCGCG CTCGCGAGAA GGGCTTCTCC GTCCCCGACA TTCAGTTCAC GAGCGCGCTC GACCGCATCC GCAACTCGGT GGTGAACGCA GCCGAGCCGG AAAAGGACGG CGGTCGCGAC CTGGCTTACG GGCTTTACGT TCTCGCCAGG AACGGCGCCG CCCCGATCGG CGATTTGCGT TATCTGGCCG ATACCAAGTT GAACAATCTG GCGACGCCGA TCGCCAAGGC GCAACTTGCC GCGGCGCTCG CGCTGGTCGG CGACCGGGCG CGCGCTGAAC GTGTTTATAT TGCTGCGGCT GAAAGCCTTG CGCCGAAACC GGCACTCGAA TTCGGCCGGG CGGACTACGG CTCGGCGCTC CGCGATGCGG CAGCGCTGGT ATCGCTGGCG AGCGAGGGCA ACGCGCCGCG AGCCACGGTC ACCCAAGCGG TGCAGCGGGT GGAAGCGGCG CGGGGGCTGA CGTCCTCAAC CTCGACGCAG GAGAACGCGT GGCTGGTCCT GGCGGCGAGG GCGCTTGCGA AAGAATCAAT GTCGCTCGAT GTCAACGGCG CGCCCGTTAA AGCCGCGCTT TATCGCAGCC ACAAGGCGGC CGAGCTGGCC GACAAGCCGA TCAAGATCAC CAACACAGGC GAAGCGCCGG TGCAGGCGGT GATCTCCGTG GCCGGCGCGC CGGTTACTCC CGAACCGGCG ACCTCGAACG GCTTCGTAAT CGAGCGCAAC TATTTCAACC TCGACGGAAC GCCCGCGGAT CCGACCCAGG CCACGCAGAA CGACCGCTTG GCGGTCGTTC TCAGGATCAC TGAAACGAAG CCTGAATACG GTCATATCCT GGTGGCGGAC TATCTTCCGG CAGGATTCGA GATCGACAAT CCTAATCTGG TATCCTCCGG GGACACCGGG ACGCTCGACT GGATCGAGGA CGGCAAGGAG CCGGTCAGCA CTGAATTCCG TGACGACCGC TTCACCGCCG CCATCGATCG CACTGCGAGC GACAAGGCGG TATTCACCGT GGCCTATGTC GTGCGCGCGG TGTCGCCCGG AAAATACGTG CTGCCGCAGG CATACGTGGA AGACATGTAC AATCCCTCGC GCTACGGCCG CACCAGCACC GGCCGCGTCG AGGTGCGTTC AACAAAATGA
|
Protein sequence | MTRLVRAVAF CATLLIGLGS VAHAADKAFK RDDLADAAIK LEARIKSEAG AVAKSGSMLR TDADTAFKRA DFRSGLQILG QIATVAPNDS GNWLRLARTI FQIKPATSQE QTFLLERAST AAYLAYQRAT DPGTEADALA VLGRAFSDRK LWRPALDALR LSLDLREVAS VREQYEKMRD DHGFRLLDYT VDSDSATPRV CFQFSEDLAK RVDFSPFVAL AGDDRPALSS EDRQLCVEGL KHGERYNINL RAGLPSAVKE SLPKSAEFNV YVRDRKPFVR FTGRAYVLPR TGQRGIPLVS VNTQNVTVKV FRIGDRNLIN TVVESDFQKA LGSHQLYELG HERGIKVWSG EVTTASTLNA DVTTAFPVDE ALGNLQPGVY VMTAAPKGPG STDDDESGSL ATQWFIVSDL GLSAYSGNDG IHVFVNSLAT TDAVDKAEVR LVARNNEILA TRKTDASGHA LFEPGLARGE GGQSPALLTV STDKADYAFL SLKSNAFDLT DRGVSGRAVP SGADAFVYAE RGVYRSGETV YLTALLRDGQ GVAVTGGPLT LVVERPDGVE YRRAVLSDHG SGGRSLDLPL NSAVPTGTWR VRAFTDPKGP SVGETTFMVE DYVPDRIEFD LTTKAKQIAA DNPVELKVDG RFLYGAPASG LQLEGELLVA PAESRPGYAG YRFGVPDDEA ASNERTPIEN LPEADANGVA TFPVSLATAP SSDRQHEAQI FIRMAEAGGR AVERKIVLPV KPSAAMIGVK PLFADKSVAE GDRARFDVVF VAPDGTSLAR KGLRYELLKL ESRYQWYRQN SYWEYEPVKS TRRVADGDLS IAADSPARIE LSPQPGRYRL DVKSSDSDGP LTSVQFDVGW YSDGSADTPD LLETSIDKPD YQSGDTMVVS VNARTAGKLT INVLGDRLLT TQTTEVKEGT SQVKIPVGKD WGTGAYVVAT LRRPLDVAAQ RMPGRAIGIK WFGIDRSART LSVNLSPPEL ARPSAPLKLP VKVGGLSPGE DAKIVVAAVD VGILNLTNYK PPAPDDYYLG QRRLTSEIRD LYGQLIDGMQ GTRGQLRTGG DFAGAELQGS PPTQKPLALY SGIVTVAADG TAEISFDIPE FAGTARVMAV AWTATKLGRA TVDVTVRDPV VLTATLPRFL LTGDRGTMSF DLDNVEGPPG DYTVNVRTSG PVKVAGNATT AIKLAAGQRS SMALTLDTAG SAGAARFDID IKGPNGLRLA RHYDLEVKPA TQILARRSVR TLAKGESLTL TSDMFSDLVA GTGGVSMSVG LSSALDAASV LKALDRYPFG CSEQIASRAM PLLYVNDLAA EAHLAMDTSI DDRIKSSIER LLARQGSNGS FGLWSSGGED SWLDAYVTDF LTRAREKGFS VPDIQFTSAL DRIRNSVVNA AEPEKDGGRD LAYGLYVLAR NGAAPIGDLR YLADTKLNNL ATPIAKAQLA AALALVGDRA RAERVYIAAA ESLAPKPALE FGRADYGSAL RDAAALVSLA SEGNAPRATV TQAVQRVEAA RGLTSSTSTQ ENAWLVLAAR ALAKESMSLD VNGAPVKAAL YRSHKAAELA DKPIKITNTG EAPVQAVISV AGAPVTPEPA TSNGFVIERN YFNLDGTPAD PTQATQNDRL AVVLRITETK PEYGHILVAD YLPAGFEIDN PNLVSSGDTG TLDWIEDGKE PVSTEFRDDR FTAAIDRTAS DKAVFTVAYV VRAVSPGKYV LPQAYVEDMY NPSRYGRTST GRVEVRSTK
|
| |