Gene Nwi_1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1542 
Symbol 
ID3676370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1687660 
End bp1688919 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content61% 
IMG OID637713097 
ProductPhage major capsid protein, HK97 
Protein accessionYP_318155 
Protein GI75675734 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG CAATGAAATC CCGCGCCCGC GGGCTCGTCG CCGCGCGCGC CGATGCCGGC 
AGCGCAACGG CTATCCTCAA CGAGCTGCGC CAGACATTCG AAACGTTCAA AGCCGAGCGC
GAGAAAGAAA TCGCTGACCT TAAGAAGGGT TTGGGCGATG TCGTGCAGTC AGAAAAGGTT
GATCGCATCA ACGCCGAGAT CACGAAGCTG CAGGAACAGC TGGACCAGGT GAACTCGTCG
ATCGCCGCTC TCAAGGTCGG CGGCGCTGGC GACGACAAGC CGCTGGCCGC CGAGCGTCGC
GAGCATGCGA GAGCCTTCAA CCAGTTCTTC CGCAATGGCG CGGAAAACGG CTTGCGGGAT
CTGGAAGTGA AAGCCGCGTT GCGCACCGAC AGTGATCCGG ATGGCGGCTT CGTCGTTCCG
GACCAGATGG AAGCGGCCAT TGACCGCGTG CTCGGCACAG TGTCGGCGAT GCGCGCGATC
TCGCGCGTCA TGTCGATTTC GTCCGGCACC TATAAGAAGT TGGTCAATCA GGGCGGCGCG
GTCGGCGGTT GGGTCGGCGA GCGGCAGGCT CGCCCAGCGA CCGCCACTCC AACGTTGGTG
GAATTGGCCT TCCAGGCGAT GGAACTCTAC GCTAACCCGG CGGCGACCCA GACGCTCCTC
GATGATTCCC GCGTCAATAT CGAGCAGTGG CTCGCCGATG AGGTATCGAT CACCTTCGCT
GAAATGGAAG GCGCGAGCTT CATCATCGGC GACGGCGTCG GCAAGCCGCG TGGGCTTCTT
TCCTACGACA CGGTAGCCGA TACCTCTTAT GCGTGGGGCA AGCTCGGCTA TGTCGTCTCC
GGAGTGGCGG CAGCGATGAC CGACTCGTCG CACAACGGCG CGGATGCGCT TACCGATCTG
GTTTACTCGA TCAAGCAGGG CTACCGGCAG AATGCGCGGT TCCTCATGAA TCGGAAGACG
CAGGCCGCGA TCCGCAAGTT CAAGTCGAAA ACCGAGGAAT TGTATCTGTG GCAGCCGTCG
ATTCAGGCCG GTCAGCCGGC GACGATCCTC GGATATCCGG TGACGGATGA CGATAACATG
CCGGACGCGA CCGCCGGCGG GAATTTCCCG ATCGCCTTCG GCGACTTCCA GCGCGGCTAC
CTGATCGTTG ACCGCATGGG CGTGCGCGTG CTGCGCGACC CGTTCACCAA CAAGCCTTAC
GTGCACTTCT ATACCACTTG TCCGTCGTCA GATAATTTGA GACTGATCAG ACATTTCTAG
 
Protein sequence
MTIAMKSRAR GLVAARADAG SATAILNELR QTFETFKAER EKEIADLKKG LGDVVQSEKV 
DRINAEITKL QEQLDQVNSS IAALKVGGAG DDKPLAAERR EHARAFNQFF RNGAENGLRD
LEVKAALRTD SDPDGGFVVP DQMEAAIDRV LGTVSAMRAI SRVMSISSGT YKKLVNQGGA
VGGWVGERQA RPATATPTLV ELAFQAMELY ANPAATQTLL DDSRVNIEQW LADEVSITFA
EMEGASFIIG DGVGKPRGLL SYDTVADTSY AWGKLGYVVS GVAAAMTDSS HNGADALTDL
VYSIKQGYRQ NARFLMNRKT QAAIRKFKSK TEELYLWQPS IQAGQPATIL GYPVTDDDNM
PDATAGGNFP IAFGDFQRGY LIVDRMGVRV LRDPFTNKPY VHFYTTCPSS DNLRLIRHF