Gene Nwi_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0049 
Symbol 
ID3676488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp56100 
End bp59261 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content69% 
IMG OID637711584 
Producthypothetical protein 
Protein accessionYP_316669 
Protein GI75674248 
COG category[L] Replication, recombination and repair 
COG ID[COG3893] Inactivated superfamily I helicase 
TIGRFAM ID[TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.258638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.374652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCC GCAACGTCCC CTCCTCAGTG CCGTTTCTGC GCGCGGTCAT CACCGCGCTG 
GTCGACGGGG AGCTGATCGA GGGATTCCGT CCGCGCGCGC AGCCCGAACG GCTTGCCGAA
GCCACGCTCT ATCTGCCGAC CCGCCGCGCC GGCCGCATGG CGCGCGATAT TTTCCTCGAC
GTGCTGGATA CCGACGCCGT GATCCTGCCG CGCATCGTCG CGCTCGGCGG AATCGACGAG
GACGAACTGG CTTTCGCGCA GGCCGTCTCG TTGCCGCCGG AGACGCTCGA ACTGCCGCCG
GCGCTGGACG GGCTCGCGCG CCGCCTCGCG CTCGCCCAGC TTATCGACGC ATGGGCGCGG
CGGCTGAATC TCGGTGACGG CGAGCCAGCG CGGGCCCCGC TGGTGCTCGG CGGTCCGGCC
TCGACGCTGA CGCTTGCCGC CGATCTCGCG CGACTGATGG ACGACATGGC CACGCGCGGC
GTGGACTGGC GCGCGCTCGA CGGGCTGGTG CCCGAGGCGC TCGACCGCTA CTGGCAGATC
ACGCTCGATT TCCTGAAGAT CGCGCGCGAC TACTGGCCGG CTCACCTTCA GGAAACCGGC
CGGATCGAAC CCGCGGCGCG GCGCGACCGC CTGATCGAAG CCGAAGCCGC GCGTCTCGCC
GCCCATCATG TGGGACCGGT GATCGCGGCG GGCTCCACCG GCTCGATGCC GTCGACCGCG
AAACTGCTGC ACGCCATCGC CACGCTGCCG CAAGGCGCGG TCGTGCTGCC AGGACTCGAC
ACCGACCTCG ATGAAGAGGC GTGGAGGTTG ATCGGCGGCG TCAGGGATAA AGCGAGCGGC
GCGTTCATCA CGCCGCCCGC CGCCGGTCAT CCGCAGTTCG CGCTGCACGG CCTGCTCGCG
CGCTTCGGCA TCAAACGATC GGAGGTGAAA GCGCTCGGCT TCCCCGCGCC ATACGGGCGC
GAAATGCTGG CGTCCGAGGC GATGCGGCCC GCGGAGGCGA CCGCGCAATG GCATACCCGG
CTCGCCGAAC CCGAGGTCGC GGATAAGATC GCGGCCGGAC TGCAAAATCT CGCCGTGATC
GCGGCCGCCA ATCCGGAAAT GGAAGCGCTC GGCATCGCCG TCGCCATGCG CGAGGCGCGC
GACCTGAACA AGACCGCGGC GCTGGTGACG TTCGACCGCG CGCTGGCGCG GCGGGTGATG
GCGGCGCTCG GCCGCTGGAA TCTCGCCTTC GACGATTCGG GCGGCGATCC GCTGATGGAC
ACGCCTGCTG GAATCTTCGC ACGGCTGGCC GCCGAGACCG CGTGCAACGG TCTGGAGCCC
CCGACCCTGC TGGCCTTGCT GAAGCATCCG CTGTGCCGGT TGGGACGCGC GGCGGGCGGC
TGGTCGCGTG CCATCATAGC GCTGGAGCTT GCGATCCTGC GAGGCCCCCG GCCGCCGCCG
GGCGGCAAGG GCCTCGCGGA CGAATTCGCG CGCTTTTGCG ATGAGCGCGA CCAGCTCGAT
CGCGGCGAGA GTTCCTCGTT GCATCGCAGC GAGCCCCGCG CGGCGCTGAA ACCGGACCAT
CTCGACGATA TCAGCGCCCT GATCGCGGCC CTGCGGGATG CGCTCGCGCC GCTCGAAGGC
GACGGCGCAT CGAGATCGGC GGACTTCACC CTCCTCGCCC TGCAGCATCG CCAGACTATC
GAATCGTTAT CCTGTGACGA TGAAGGCGTC GCCGTCGCAT TCGACGGCCG GCACGGCCGT
GCCCTCGCTG ATGCTTTCGA CGATCTGGTT GATGCGGGTG AGCGCAGCGG CCTGACGGTC
AGGATCGCGG ACTATCCGGA AACCTTCGAG GCCGCGTTCG GCGATCGCGT GGTGCGCCGG
CCGCAGGCGA CATCGGCAAG CCTGCGCATC TACGGACCGC TCGAAGCGCG TCTGACCCAA
TGCGACCGCG TCATCCTCGG CGGGCTCACC GAAGGCGTCT GGCCGCCCGC GCCGCCGACC
GATCCCTGGC TCAGCCGGCC GATGCGGCAC GAACTGGGGC TCGATCTTCC GGAACGGCGC
ATCGGACTGT CCGCGCACGA TTTCACGCAA CTGTTCGGCG CCGAGGAGGT CATCCTCAGC
CATGCCGCCA AGGCCGCAGG CGCGCCGGCG GTGGCGTCCC GTTTCCTGCA TCGGCTCGAA
GCCGTCGCGG GCGCGACACG CTGGACATCG GCGAAACAAG CCGGCGCGCG ATATATTCAG
TATGCCGAAG CGCTCGATCG GCCCAGCGAG GTTACACCGA TCGCGCAGCC CGCGCCGAAG
CCGCCGCGCA CGGCGCGGCC GACGCGCCTG TCCGTGACCG CGATCGAGGA CTGGCTGCGC
GATCCCTACA CCATCTACGC CAGATACATC TTGAAGCTGC TGCCGCTGGA GGCCGTCGAC
ATGCCCTTGT CGGCGGCAGA CCGCGGCTCG GCGATTCACG ACGCGCTCGG CGACTTCACC
AAACGTTATC CGGCAAGCCT GCCCGACGAT CCGGAAGACG TGCTGCGCGC CATCGGCGAA
AGCCGTTTCG CGCCGCTGAT GCAGCGGCCG GAAGCGCGCG CGCTGTGGTG GCCGCGCTTC
CAGCGCATCG CGGCATGGTT CGCGGAATGG GAGCCGACGC GGCGGGCGCA TCTGGTCAGG
ATCGACGCCG AGGTCAGCGG AAAAATCGAA ATCCCGATCG ACGGCGATCG CAGGTTCACT
CTGTCGGCGC GCGCCGACCG TATCGAACAT CTCGGCGGCG GCCGCTTCGC CGTTCTCGAC
TACAAGACCG GCAGTCCTCC GAGCAGCAAG CAGGTGCGGC TCGGCCTGTC GCCGCAGCTC
ACGCTGGAAT CCGCGATCCT GCGCAACGGC GGCTTCGCGG GCATTCCCCC CGGCGCATCG
GTCAGCGAGC TGGTCTATGT CCGGCTCAGC GGCAACAATC CTGCCGGCGA ACCGAGACCG
GTCGATCTCG ATAACGGCAA AACCGCAACC CGGTCGCCCG ATCAGGCGGC CGATGTCGCG
CTTGAAGAAC TCACCGCGCT GATCCGTGCC TTCGACGACG AGCAGCAGGG CTACGCATCG
CTCGACCTGC CGATGTGGAA GGCCCGCTAC GGGGTCTATG ACGATCTCGC ACGGATCAAG
GAATGGTCGG CGGCGGGCGG GCCGGGGTCG GAGGAATGGT GA
 
Protein sequence
MRVRNVPSSV PFLRAVITAL VDGELIEGFR PRAQPERLAE ATLYLPTRRA GRMARDIFLD 
VLDTDAVILP RIVALGGIDE DELAFAQAVS LPPETLELPP ALDGLARRLA LAQLIDAWAR
RLNLGDGEPA RAPLVLGGPA STLTLAADLA RLMDDMATRG VDWRALDGLV PEALDRYWQI
TLDFLKIARD YWPAHLQETG RIEPAARRDR LIEAEAARLA AHHVGPVIAA GSTGSMPSTA
KLLHAIATLP QGAVVLPGLD TDLDEEAWRL IGGVRDKASG AFITPPAAGH PQFALHGLLA
RFGIKRSEVK ALGFPAPYGR EMLASEAMRP AEATAQWHTR LAEPEVADKI AAGLQNLAVI
AAANPEMEAL GIAVAMREAR DLNKTAALVT FDRALARRVM AALGRWNLAF DDSGGDPLMD
TPAGIFARLA AETACNGLEP PTLLALLKHP LCRLGRAAGG WSRAIIALEL AILRGPRPPP
GGKGLADEFA RFCDERDQLD RGESSSLHRS EPRAALKPDH LDDISALIAA LRDALAPLEG
DGASRSADFT LLALQHRQTI ESLSCDDEGV AVAFDGRHGR ALADAFDDLV DAGERSGLTV
RIADYPETFE AAFGDRVVRR PQATSASLRI YGPLEARLTQ CDRVILGGLT EGVWPPAPPT
DPWLSRPMRH ELGLDLPERR IGLSAHDFTQ LFGAEEVILS HAAKAAGAPA VASRFLHRLE
AVAGATRWTS AKQAGARYIQ YAEALDRPSE VTPIAQPAPK PPRTARPTRL SVTAIEDWLR
DPYTIYARYI LKLLPLEAVD MPLSAADRGS AIHDALGDFT KRYPASLPDD PEDVLRAIGE
SRFAPLMQRP EARALWWPRF QRIAAWFAEW EPTRRAHLVR IDAEVSGKIE IPIDGDRRFT
LSARADRIEH LGGGRFAVLD YKTGSPPSSK QVRLGLSPQL TLESAILRNG GFAGIPPGAS
VSELVYVRLS GNNPAGEPRP VDLDNGKTAT RSPDQAADVA LEELTALIRA FDDEQQGYAS
LDLPMWKARY GVYDDLARIK EWSAAGGPGS EEW