Gene Nwi_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_3004 
Symbol 
ID3675031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp3250656 
End bp3253904 
Gene Length3249 bp 
Protein Length1082 aa 
Translation table11 
GC content65% 
IMG OID637714570 
ProductTPR repeat-containing protein 
Protein accessionYP_319606 
Protein GI75677185 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTAT CCGTTAGCAG CACGATCGAG GAGATGCAGG CGGAGCGTAA CGCGCTTCAA 
AGCTTTGCCG ACGATCGCGC CCGCGACTTC GTCGGCAGGC AGTCGGTGCT TGCGCGTGTC
ACCGACCTCT GTCTCTCCCC CGCCAAGGAG CTTTCTCCCA CGAAGGAGGG CCCGCCATGG
GGGATCTGCA TCACCGGCGA TCCGGGTTCG GGCAAGAGCG CCCTGTTCGG CGAACTTCTC
CGCCGCCTGA AGGAAACCGA TGCCTTCATC CTCGCGCATG CCGCGGAAGC GACCCCGCAG
GCCACGTCGA TCGATGCCAT GCTGCGCCGC TGGATTACCG AGCTTGGCAG CGCCCTCGGC
GCCGGCGATA TCGACCTCGC CGCCAATATC GATCCGGAGA TTGTCGAACG GGCCTTCGCC
TCGCTGCTGG TGCGGATGGC TTCGAAACGG CGCGTGGTGG TCCTGATCGA CGCGATCGAC
CAGTTCAGGA AAACCTCTCG CGGGAAGTTT ACGAGCTGGC TGCCGCGGCT GTGGCCGATC
AATGCGCGCC TGGTCGCGAC CGGCGTTGCC GGAGGCGCCT CCAAGGCGCT GTCCGAGCGC
TCCAGTGTGG AGGCGTTGGC TCTGCCGCCG CTGGAGTCGT CCGAGGCGGA GGGCATCATC
GAAGCGGTCT GCAAGCGCAA CCAGCGCAAG CTGGCTTCAC CGGTGATCGA TGCACTGCTC
GCGAAGAAGC ATGCGGGCGG GCCGGCCTGG GGCAATCCGT TATGGCTGGT GCTGGCGCTC
GAGGAACTCG AACTGCTCGA CAGCAGTGAT TTCGCCGACA TGCAGCGTGA GTATGCCGGC
TCTCCGGAGC AGCGCCTGCA GAGTGCGATG CTGGATGCGG TCGGCGCGAT GCCGACCGAC
ATTCCGCTTC TCTGCCGCGC GACGTTCGAT AGCGCGGCCC AGCTGTTGAG CCCCGCGATC
GCGGCGGCGT TCATCGGCCT CCTTTCCGCG GGCCGGACAG GATGGCGAGA GAGTGACTTC
CGTCAATTGC TGCCGCAGGT CGGTGGCGAG GCATGGGACC AGAAGCGCTT TGCTACGCTG
CGCAACCTGT TCCGCGGTCA GGTTCATCAG CATGGCGACC TTGCGCAACT GGATTTCAAC
CATTCCCAGG CGCGCGCCGC CGCCCGTTCG CGCCTTGCCG CGTTGCGCGT TACGAATCCC
GAACTGCACC TGCTGATCGC GGATCACCTT CTGACGCTTG CACCGGACGA CCGGCTGCGG
GTGACTGAAA CCATGGTCCA TCTTCTGGCA AGCAAGGACG ACGTCCGCGC GGCCCGGCAC
TATGGCGATC CATCGCTCGG CGAAAAGGAG CTGGGAGCCG CCACGCTGGC GCTCGCCGAT
GCGGTCATTT CGCCGGCGAC CGGCACTCCG GCGAGCGCGG CGCGCGAGAT GTGCCGTTTG
CTTGAGAACC CGGATCACTC CGTCCGCGCC CGCGTGGCCG AGCGGCTGCT CGTCAGTTTT
GACGATGTGG CGGGCCGGCA CATTGCTTCC GATGTCCGCC TGATCGTTTT GAACGCGATC
GAACAAGCGT TCCGGCAACT GCTGCGCGAC GAACCCGGCA ATGCCGGCTG GCAGCGCAAT
CTGTCGGTTG CCCACGATCG CGTCGGCGAC GTGCTGGTGG CGCAGGGCAA GCTGCCTGAG
GCGCTTGCGT CGTTCCGTGA GGGACTTGCG ATCCGGGAGA AATTGACGGG CACGGACCCG
GACAATGCCG GGCTCCAGCA CGATCTATCG ATCTCGTACG AGAAGATCGG CGATCTGCTT
GCGGTGCAAG GCAAGCTCGA CGAGGCGCTG GACTCGATCC GCAAGCAGCT CGCGGTTGTC
GAGCGGCTGG CGAACAGCGA TCCCGACAAC GCAGACCTGC AGAGCGAGAT CTCGCTGTGC
CATGAAAAGA TCGGCGAAGC GCTGATGGCG CGAGGCGATC TTCCGGGAGC GCTGGGCGCG
TTCCAGAACC AGCTCGAGAT CAGCGAGCGG CTGGCGCGCG GCGCGGACAC CGACGACAGC
AAATGGCAGC GCGAGCGAAC GCTCGCTTAC GATCGCGTCG GCGACGTGCT GATGGCGCAA
GGCAAGCTGC CGGAGGCGCT CGAATTTTTC CGCAACGGAC TGACGATCAA GGAGCGGGTC
GCAAAGGCTC ACCCCGAGAA CACGAGCTGG CAACGCAGCC TGTCGATCTC CCATGATCGC
ATCGGCGACG CGCTGGTGAC GCAGGGCAAA CTGCCTGACG CGTTGACCTC TTATCGCACC
GGACTTGAGA TCGCCCAGAA ACTGGCGCGC ACCGAGCCCG AGAATGTCGG CTGGCAGCGC
GACCTGTCGG TGTCGCACGC CAAGATCGGC GATGTGCTGG TGGCGCAGAG CAGTCTGCCC
GACGCATTGA AAGCCTTCCA TGAAGGTTTG ACGATCAGGC AGCGCCTGGC GAACGCCGAT
CCCGACAATG TCGATTGGCA GCGCGGTCTG GCGGTGTCCT ACGATCGCAT CGGCGACGTG
CTGATGGCGC AGGACAACAT GGGCGATGCG CTGAGCGCCT TCCAGGATCA GTTCGCGATC
GCCGAGAAGC TGGCGCGGAT CGATCCGAAC AACACCGGCT GGCAGCGCGA TCTGTCCGTA
TCCTACGAGA AAGTCGGCGA CGTCCAGATG GCGCAGGGCA ATCTGCCCGA CGCGTTGAAA
TCCTATCGCG GGGGGCTCGC GATCAGAGAG CGGCTGGCGC GCACCGAGCC CGATAACATC
AACTGGCAGC GCAGCCTGTC GGTGTCGCAC GATTGCATCG GCGACGTGCT GGAAGCTCAG
GGCGAGGTGG ACGAGGCGTT GAAAGCCTTC CACGAAGGGC TTGAGATCAG GAAACGGCTA
GCGCAGCGTG ATCCGTCCAA CGTCGGATGG CAGCGCGACC TGACCGTATC TTACGATCGT
CTCGGCGAGG TGTTGGAATC GAAGGGCGAC CTGGAAGCGG CCGAAAAGTC CTTCCGCGAA
GGGCTGGCGA TCATGGAGCG GCTGTCGCTT GCCGACCCCG GCAACGTCGA TCTGCAGCGC
GGCGTGGCCG TGAGCCAGGG ACACCTGGCC GAAATGTACC GCCGGTCCAA CGATCACAAC
AGCGCGCTGG CCGCATTGCG GCAGGGACAG GCCGCCATGG AGCGCGTCGT CATGCGCGCA
CCTGAGAATG CCGGCTGGAA AAAAGATCTG GACTGGTTCA CCGAACAGAT CTCGACCTTG
ACGGATTAG
 
Protein sequence
MSVSVSSTIE EMQAERNALQ SFADDRARDF VGRQSVLARV TDLCLSPAKE LSPTKEGPPW 
GICITGDPGS GKSALFGELL RRLKETDAFI LAHAAEATPQ ATSIDAMLRR WITELGSALG
AGDIDLAANI DPEIVERAFA SLLVRMASKR RVVVLIDAID QFRKTSRGKF TSWLPRLWPI
NARLVATGVA GGASKALSER SSVEALALPP LESSEAEGII EAVCKRNQRK LASPVIDALL
AKKHAGGPAW GNPLWLVLAL EELELLDSSD FADMQREYAG SPEQRLQSAM LDAVGAMPTD
IPLLCRATFD SAAQLLSPAI AAAFIGLLSA GRTGWRESDF RQLLPQVGGE AWDQKRFATL
RNLFRGQVHQ HGDLAQLDFN HSQARAAARS RLAALRVTNP ELHLLIADHL LTLAPDDRLR
VTETMVHLLA SKDDVRAARH YGDPSLGEKE LGAATLALAD AVISPATGTP ASAAREMCRL
LENPDHSVRA RVAERLLVSF DDVAGRHIAS DVRLIVLNAI EQAFRQLLRD EPGNAGWQRN
LSVAHDRVGD VLVAQGKLPE ALASFREGLA IREKLTGTDP DNAGLQHDLS ISYEKIGDLL
AVQGKLDEAL DSIRKQLAVV ERLANSDPDN ADLQSEISLC HEKIGEALMA RGDLPGALGA
FQNQLEISER LARGADTDDS KWQRERTLAY DRVGDVLMAQ GKLPEALEFF RNGLTIKERV
AKAHPENTSW QRSLSISHDR IGDALVTQGK LPDALTSYRT GLEIAQKLAR TEPENVGWQR
DLSVSHAKIG DVLVAQSSLP DALKAFHEGL TIRQRLANAD PDNVDWQRGL AVSYDRIGDV
LMAQDNMGDA LSAFQDQFAI AEKLARIDPN NTGWQRDLSV SYEKVGDVQM AQGNLPDALK
SYRGGLAIRE RLARTEPDNI NWQRSLSVSH DCIGDVLEAQ GEVDEALKAF HEGLEIRKRL
AQRDPSNVGW QRDLTVSYDR LGEVLESKGD LEAAEKSFRE GLAIMERLSL ADPGNVDLQR
GVAVSQGHLA EMYRRSNDHN SALAALRQGQ AAMERVVMRA PENAGWKKDL DWFTEQISTL
TD