Gene Nwi_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1938 
Symbol 
ID3674782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2125727 
End bp2126677 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content60% 
IMG OID637713503 
ProductCRISPR-associated Csh2 family protein 
Protein accessionYP_318550 
Protein GI75676129 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02589] CRISPR-associated protein, Csd2 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.364907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGC TCACGAACCG CTACGATTTC GTCCTGCTCT TCGACGTAAC CAAAGGCAAT 
CCGAATGGCG ATCCTGACGC CGGAAACCTG CCGAGGCTCG ATCCCGAAAC CAACCACGGG
CTGGTGTCGG ACGTCAGCCT GAAACGCAAG GTGCGTAACT ACGTCGATCT CGTCCGATCC
GGAACCGATG GCCACCACAT CTATGTCGAG GAGGCCGCGA TCCTCAACGA CAAGCACCGC
CAAGCCTACA AGGCACTACG GCCGGACGAT CCGAAGGTTG ATAAAGAAGC CAAGCTCAAC
CCGAGGGATG ATGTCGAGGC GAAGAAGCTG CGGGAGTTCA TGTGCAAGAA CTTCTTCGAC
GTGCGTACCT TTGGCGCAGT CATGTCGACC GGCATCAATG CCGGTCAGGT CAGAGGCCCC
GTGCAGATGA CCTTCGCCAA CTCGGTCGAG CCCATCGTGC CGCAGGAAAT TTCGATTACC
CGCATGGCCG CCACCAACGA GGCGGAGAAG AAGCAACGGG CCGAAGGCGG TGAGGAAGGC
AACGACCGGG TCGACAATAG GACGATGGGA CGCAAGTACA TTGTGCCCTA TGGGCTCTAC
CGCGCGCATG GTTTCGTCTC GGCTAAGCTC GCCGAGCGCA CCGGTTTCTC CGAGGCCGAC
CTCGAACTGA CGTTCGAGGC GCTGACCAGT ATGTTCGAGC ACGATCGCTC CGCCGCGCGC
GGCGAGATGA CGACGCGCAA GCTCGTCGTC TTCAAGCACG GCAACGCGCT TGGCAGTGCG
CCAGCGCATG CGCTGTTCGA GCGCGTCAGA ATCGGCCGCA ACATCGACGG GCAGTTCCGA
AGGATCGACC GGTCGGACAA CTATCCGCCG GCACGTGCCT TTTCGGATTA CGCCGTGGAG
ATCGACCGCG ACAATCCGCC AGACGGCGTG GAAATCATTG AAAGGATCTG A
 
Protein sequence
MTMLTNRYDF VLLFDVTKGN PNGDPDAGNL PRLDPETNHG LVSDVSLKRK VRNYVDLVRS 
GTDGHHIYVE EAAILNDKHR QAYKALRPDD PKVDKEAKLN PRDDVEAKKL REFMCKNFFD
VRTFGAVMST GINAGQVRGP VQMTFANSVE PIVPQEISIT RMAATNEAEK KQRAEGGEEG
NDRVDNRTMG RKYIVPYGLY RAHGFVSAKL AERTGFSEAD LELTFEALTS MFEHDRSAAR
GEMTTRKLVV FKHGNALGSA PAHALFERVR IGRNIDGQFR RIDRSDNYPP ARAFSDYAVE
IDRDNPPDGV EIIERI