Gene Nwi_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1590 
Symbol 
ID3676756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1737934 
End bp1739244 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content61% 
IMG OID637713146 
Producthemolysins and related proteins 
Protein accessionYP_318203 
Protein GI75675782 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.727705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGATG ACAGTTTAAC CAGTCTGGCC GGAGTTCTCT CTGTCATCAT TCTTGTCATC 
GCGAACGGCT TTTTCGTCGC TTCAGAATTC TCACTGGTGG CGATGCGTCG CAGCCGCGTC
GCGCAACTGG TTGCCGCCGG AGGCGCAAAC GCCCGTGCGC TACAGAAGGC GGTTGAACAT
CTCGATTATA ATCTCGCGGC GTGCCAGCTC GGAATTACGC TGTCGTCATT GGCTCTGGGC
TGGATCGGAG AACCGGCGCT GGCGCACCTG ATCGAGCCGC TGCTGGTTCT GCCATTCGGT
ACCGGCGCCG AAATCGGCGC TCACGCCATC GCCATCATTA TCGCCTTTGC CATCATTACG
ATCCTGCACA TCGTACTTGG CGAACTCGCG CCGAAGAGCC TCGCCTTGCA GCGGACCGAG
ACCACCGCGT TGATCATCGT GCGTCCGCTC GCGCTGTTCG GGTTGCTGTT GAAACCGGCG
ATCGTGTCAT TGAACGGGCT CGGCAACGGC GTCTTGCGTC TGTTCGGCCT CAGCGCCGCC
AGCGAAAAGG AAACGCTTCA CTCACCTGAC GAGATCAGGC TTCTCGTTGC CGAGACCGAG
CGAGCGGGCT TGCTTGCTCG AACCCAGCGG GAAGTCGTTG AGCGGGTCAT CAGCCTCACC
CGTCGCGACG TGAGCGACAT CATGACCCCG CGGATGAGCA TCGTTTGGGC CGACGCGGAT
GACAGCCGGG ACGAGATTCT CAATGTAGTG CGTGAGTGCA AGCATGAGCA CGTCGTTATC
GGGCAGGGTA GCATCGACGA GGTCGTGGGT GTACTGCGCA AGCAAGACCT GCTCGATCAG
GCGCTTGAGG GACGTGAGGT CGATCCGCTC GCGGTGTTGC AGCAACCGCT CGCACTCTCG
GAGTTCACGC CGATCCTGCA AGCCATCGAG CGTTTCAAGG CGCAACCTGT TCGCGTCGGC
ACCGTGGTCG ACGAGTACGG AACGTTGCAG GGGATCGTCA CGCGCACCGA TCTGCTCGAG
TCCATCGCAG GTGAACTGCC GGATGCCGGT GAGGAGCCGG ACATACTGGG AGGAGGCGAC
GGCAAGTTTG TCATCAACGG ACGAATGCCG ATCGACGAAG CGATGATCCA CCTGGGCATC
AGCCGCAAGC CCGATGGAGA TTTCCATACC GTGGCGGGCT GCGCGATCGA GTTGCTGGGT
CGAATACCGG CGGTCGGCGA CGAGTTCGCC TGGGAAGGGT GGCTGTTCCG GATCTCGGAA
ATGGACGGTC CGCGGATCAG CAGGCTAATC GCGTCGAGGT CCGTCGAATG A
 
Protein sequence
MGDDSLTSLA GVLSVIILVI ANGFFVASEF SLVAMRRSRV AQLVAAGGAN ARALQKAVEH 
LDYNLAACQL GITLSSLALG WIGEPALAHL IEPLLVLPFG TGAEIGAHAI AIIIAFAIIT
ILHIVLGELA PKSLALQRTE TTALIIVRPL ALFGLLLKPA IVSLNGLGNG VLRLFGLSAA
SEKETLHSPD EIRLLVAETE RAGLLARTQR EVVERVISLT RRDVSDIMTP RMSIVWADAD
DSRDEILNVV RECKHEHVVI GQGSIDEVVG VLRKQDLLDQ ALEGREVDPL AVLQQPLALS
EFTPILQAIE RFKAQPVRVG TVVDEYGTLQ GIVTRTDLLE SIAGELPDAG EEPDILGGGD
GKFVINGRMP IDEAMIHLGI SRKPDGDFHT VAGCAIELLG RIPAVGDEFA WEGWLFRISE
MDGPRISRLI ASRSVE