Gene Nwi_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0470 
Symbol 
ID3676502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp526966 
End bp528039 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content70% 
IMG OID637712012 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_317089 
Protein GI75674668 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.859268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGTGC TAGGGATCGA GACCACCTGC GACGAGACCG CCGCCGCCGT GGTCGAGCGT 
CTCCCGGACG GCAGCGCCCG GATTCTCTCC AACATCGTGC GTTCGCAGAC CGAAGAACAT
GCCCCCTATG GCGGCGTGGT TCCGGAAATC GCGGCGCGCG CCCATGTCGA ACTGCTCGAC
GGCCTCATTG CCCGTGCGAT GACGGAATCC GGCGTCGGTT TCCGGCAGTT GTCGGGGGTC
GCCGCCGCCG CCGGTCCCGG CCTGATCGGC GGCGTCATCG TGGGGCTGAC CACGGCCAAG
GCGATCGCGC TTGTTCATGG CACGCCGCTA ACTGCGGTCA ACCATCTCGA AGCCCACGCG
CTGACGCCGC GGCTGACCAG CCGGCTCGAA TTCCCCTACT GCCTGTTCCT CGCTTCCGGC
GGGCATACCC AGATCGTCGC GGTGCTCGGC GTCGGCAACT ATGTCCGGCT CGGCACCACC
GTGGACGACG CCATGGGCGA GGCTTTCGAC AAGGTCGCGA AGATGCTCGG CCTGCCCTAC
CCCGGCGGCC CGGAGGTCGA GCGCGCCGCC GCCAGCGGCG ATGCGACGCG ATTCAATTTT
CCTCGGCCGA TGCTCGGGCG TCCCGACGCC AACTTCTCGC TCTCTGGGCT GAAGACCGCC
GTGCGCAACG AGGCTGCCCG CATCGATCCG CTGGAGCCGC GGGACATCAG CGATCTCTGC
GCCGGATTTC AGGCGGCCGT GCTCGAAGCC ACCGCGGACC GGCTCGGCGT CGGCCTCAGG
CTTTTTGAAG AGCGGTTCGG GAGGCCGCGC GCGCTGGTCG CCGCCGGCGG CGTCGCCGCC
AATCAGGCGA TCCGCGCCTC GCTGGAGGGC GTCGCCGCCA AGGCACGGAC CTCCCTCATC
ATCCCGCCGC CGGCGCTTTG CACCGACAAT GGCGCGATGA TCGCATGGGC CGGCGCCGAA
CGGCTTGCCG CGGGATTGAC GGACTCGCTT GAAACGCCGC CGCGCGCCCG CTGGCTGCTG
GATGCCAATG CGCAGGCGCC GGCAGGCTTC GCCAACACCC GCGCCGGATT TTGA
 
Protein sequence
MLVLGIETTC DETAAAVVER LPDGSARILS NIVRSQTEEH APYGGVVPEI AARAHVELLD 
GLIARAMTES GVGFRQLSGV AAAAGPGLIG GVIVGLTTAK AIALVHGTPL TAVNHLEAHA
LTPRLTSRLE FPYCLFLASG GHTQIVAVLG VGNYVRLGTT VDDAMGEAFD KVAKMLGLPY
PGGPEVERAA ASGDATRFNF PRPMLGRPDA NFSLSGLKTA VRNEAARIDP LEPRDISDLC
AGFQAAVLEA TADRLGVGLR LFEERFGRPR ALVAAGGVAA NQAIRASLEG VAAKARTSLI
IPPPALCTDN GAMIAWAGAE RLAAGLTDSL ETPPRARWLL DANAQAPAGF ANTRAGF