Gene Nwi_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1990 
Symbol 
ID3674254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2176161 
End bp2179274 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content69% 
IMG OID637713554 
Producthypothetical protein 
Protein accessionYP_318601 
Protein GI75676180 
COG category[S] Function unknown 
COG ID[COG3002] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGATG CATTGAACCT CGGCCGGCGT CTACGGGTGC GCTCGACCGC CTATGTGGCG 
GGTGAGCCGG TGCCGTTCTT CTGGCCGATG CGGACCTTCA TTCATCACAA CCCGCTCTAC
GGCCTCGAGG ACATGCCGTT CGAGCAGGCG GTGCGTCGCG GCGCGGAGCT GTTTCACGCC
CGCATGTTCC TGCCGCGCAG CGACTATCAG CGCTGGCAGC GCGAAGGCAA GGTGCGGCCG
GAGACCCTCA AGGAGGAGAT CGGGCGACGC TCGCAGGACC TGCCTCCGGT GCCGGGCGTC
GACTGGCCGC GCTGGCTGCA TGCGCTGATG CAGACGCCGC ACGACCGCGA TGCGGTGGTG
CGCGGCGTGC GGGCGAAGGA CGTCCACGCC GCGCTGCACG GCCATCCGCC TTCCGCGCAG
GCCGTCGACG TCGCCGCGCT GCTGCCGGAG CTTGAGCAGC GCCTGCACGC GCGCACGCTG
CCCGAGGCGG TCGATGCCCT GTGGGGCACG GGACTCGCCG ACGAGCTCGA CGAACTCGTC
ATCAAGAACT GCCTTGATTT CTTCGACGAA GACCAGTCCG CCTGGCGGAT GCCGGGGCGC
GAGCGCGGCC TGTTCTCCGC CTGGAGCGAG GTCACGCGGC GCAACGCGCG CATGGTCCTG
CGCCGGCTGC ATGTGCGGCA CATCCTCGAC CATGTGCAGG ACGCCGAGAG CGCGGTGGTG
TACGTCATGG AGGAGATGGG CATCGGCGCC GAGGCCTGGC CGACCTACTT CACCCGCGTG
CTCACCCGGC TGCACGGCTG GACCGGCTTC GTGCGCTGGC GGGCGTCGGC CAAACATTAC
TATTGGGCGC AGCAGTACCC GGCCGATATC GTCGATCTGC TGGCGATCCG GCTGGTCATG
GGGCTGGCGC TGCTGCAGGA AAGCGCGCGC CATCGCGGCA CGCCGATGCG GCGTGACGAT
CTTGGCGCCG TGGTGCGCGA ACGCGGCGCG GAATGCGTGC TGCGGTACGC GCTGCACAGC
GGCGAGGTGC TGCCCGACTG GGCGCAGCGC ATCGACGACA CGCTGTCGCG CGGCAACGGC
GCCCGCTGCC ATGATCTCCT GCAACGCTAC TGGCCGCTGT GGCAGGCCCG CCTCGGGCAG
CAACAGGCCG CCGCGCTGCG CGAGCTGGCC GCCGCGGCGA ACGCCACCGC CGCGCTCGAT
GCGCTGGCGC CGGAGGACGT CGAGGGCCTG TTGCAGGGGC TGCGGGACTT CGCGCCGCAG
GAAGGCATGG TGTGGACCCT GGCGATGGAG GGGCAGGCCA TCGACAAGCT GCTCACCCAG
GTGCAGGCGC CGCAGGATCC GCCGCCCGAC AAGATACCGT TCGCGCAGGC CTGGTTCTGC
ATCGACGTGC GCGCCGAACC CATCCGCCGC CACCTGGAGC GCGTGGGCAA TTACCAGACC
TTCGGCATCG CCGGCTTCTT CGGCGTGCCG GTGGGCTTCC TCGGCTACGG CAAGGGCAGC
GAAAGCCACT ACTGCCCGGC GGTGATCACG CCCAAGAACC TGGTGCTGGA GCTGCCCGCC
GCGCTGGACC CCCACAACGA GGATTTCGTC AGCACGCTCG GCCACGTGCT GCACGATCTC
AAGAAGTCGG TGCTCTCGCC CTATGTCACG GTGGAAGCGG TGGGCATGCT GTTCGGGCTG
GACCTGTTCG GCAAAACCCT GGCGCCCCTG GGCTACAGCC GCTGGCGCCG CCGCATCGAC
ACCGAAAAGC CGGTGACCCG CCTGCTGGTC GACAAGCTCA GCCGCGAGCA GGCCGACTCC
ATCATCCGCA CCCTGCAGCG GGCGATGATC GTCAAGGCGC TGCACACCGA ACTGAAGATC
GAGCGCGAGC GGGTGGATGA CGACATGATC CGCGAACTGC GCGAGATCGC GCTGCGCCGC
CGCGACGGGC CGACGCGGCT GCGCACGACG TTCGGCGTGC CCCAGACGCA GGAGGCCGAG
TTCATCGACA AGCTGCGTCA GGTCTACGGC GTCGATGCCG ACTACACCAA TCATCAGCTC
GAGCGGCTGG GGCGCATCGG CTACTCGCTC GACGAGCAGG TCAACTACGT GCACACGGCC
CTGACCATGA TCGGGCTGAC CCAGACCTTC TCGCGCTTCG TGCTGGTGGT CGGCCACGGC
GGCAAGACCG AGAACAATCC TTACGAGTCA GCTCTGGACT GCGGCGCCTG CGGCGGCGCC
AGCGGCATCG TGAACGCCCG GGTGTTCGCG CAGATGGCCA ACAAGCCCGC GGTGCGCGAG
CGGCTGGCGG CCATGGGCAT CACCATTCCG GAAGACACCT GGTTCATGCC CGCGCTGCAC
GTCACCACCA CCGACGCCAT CGAACTGTTT GACCTCGACC TGCTGCCGCC ACGCCACCTG
GTATATCTGG AACGCCTGCG GGACGGCCTG CGAGCGGCCT CGCGCCTGAC CGCGGCCGAG
CGCATGCCGA AGCTGTTGCC GGAGGCGAAG GCGCTCGAGC CGGCGGAAGC CTTGCGCCTG
GCGAACCGCC TGGCGGTGGA CTGGGCGCAG GTCCGCCCCG AATGGGGACT GTCGGGGAAC
GTCTACGGCA TCGTCGGCCG CCGCGCGCTC ACCGAGAACT CGGATCTGCA GGGCTCCGCC
TTCCTGCTGT CCTACGACTG GCGCTGCGAT CCCAGGGGCC GCCTGCTGGA GAATCTGCTG
GCAGCCCCGG TGGTGGTGGG CCAGTGGATC AACCTCGAAC ACTTCTTCTC CACCGTGGAC
AATGCCCATC TGGGCAGCGG CAGCAAGGTC TACCACAACG TGTCCGGGCG CTTCGGGGTG
ATGACCGGCA GCCTGAGCGA TCTGCGCACC GGCCTGCCGA TGCAGACGGT GATGCGCGAG
GGACGCCCTT ACCACGAACC GATGCGCCTG ATCGCGCTGA TCGAGGCGCC GCTGGACTTC
GCCGGCCGCG TGCTGGAGCG CGTGGTCAAG GTCAAAAGCC TGGTGCTCGG CGGCTGGATC
CGCGCCATCG TCATCGACCC CACCCAGGGC TACAAGCCCT TCGTCTTCAA CAACGGCCAG
TGGGAGGAGC GGACCCCTCT AATCGCTCCT GCCGAGAAGG AATACTCCGC ATGA
 
Protein sequence
MVDALNLGRR LRVRSTAYVA GEPVPFFWPM RTFIHHNPLY GLEDMPFEQA VRRGAELFHA 
RMFLPRSDYQ RWQREGKVRP ETLKEEIGRR SQDLPPVPGV DWPRWLHALM QTPHDRDAVV
RGVRAKDVHA ALHGHPPSAQ AVDVAALLPE LEQRLHARTL PEAVDALWGT GLADELDELV
IKNCLDFFDE DQSAWRMPGR ERGLFSAWSE VTRRNARMVL RRLHVRHILD HVQDAESAVV
YVMEEMGIGA EAWPTYFTRV LTRLHGWTGF VRWRASAKHY YWAQQYPADI VDLLAIRLVM
GLALLQESAR HRGTPMRRDD LGAVVRERGA ECVLRYALHS GEVLPDWAQR IDDTLSRGNG
ARCHDLLQRY WPLWQARLGQ QQAAALRELA AAANATAALD ALAPEDVEGL LQGLRDFAPQ
EGMVWTLAME GQAIDKLLTQ VQAPQDPPPD KIPFAQAWFC IDVRAEPIRR HLERVGNYQT
FGIAGFFGVP VGFLGYGKGS ESHYCPAVIT PKNLVLELPA ALDPHNEDFV STLGHVLHDL
KKSVLSPYVT VEAVGMLFGL DLFGKTLAPL GYSRWRRRID TEKPVTRLLV DKLSREQADS
IIRTLQRAMI VKALHTELKI ERERVDDDMI RELREIALRR RDGPTRLRTT FGVPQTQEAE
FIDKLRQVYG VDADYTNHQL ERLGRIGYSL DEQVNYVHTA LTMIGLTQTF SRFVLVVGHG
GKTENNPYES ALDCGACGGA SGIVNARVFA QMANKPAVRE RLAAMGITIP EDTWFMPALH
VTTTDAIELF DLDLLPPRHL VYLERLRDGL RAASRLTAAE RMPKLLPEAK ALEPAEALRL
ANRLAVDWAQ VRPEWGLSGN VYGIVGRRAL TENSDLQGSA FLLSYDWRCD PRGRLLENLL
AAPVVVGQWI NLEHFFSTVD NAHLGSGSKV YHNVSGRFGV MTGSLSDLRT GLPMQTVMRE
GRPYHEPMRL IALIEAPLDF AGRVLERVVK VKSLVLGGWI RAIVIDPTQG YKPFVFNNGQ
WEERTPLIAP AEKEYSA