Gene Nwi_1625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1625 
Symbol 
ID3675363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1769731 
End bp1770888 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content60% 
IMG OID637713183 
ProductPhage major capsid protein, HK97 
Protein accessionYP_318238 
Protein GI75675817 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.490763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.450118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGTAAGC ATTTCTCTGA TCTCGAATTC AAAGATACCG ACGACGCCGA TCCGGTCGAT 
CTGGTGACGA AGGCGGTCAA TGATCTGACT ACCAACGTTG ACGCGCGCTT GAAGGACATC
GAGACCAAAG CCGACACGAC CAAGCTCGTG GACCGCATGG ACAAGCTCGA AGCCAAGATT
AACCGGCCGG GCAGTGGCGA TCACAAGGAT CCGGACGCCG CGCTCGAAAT CAAGGCGTTC
GGCGCTTATG TCCGTTCCGG CATCACGCCG CCTGATCCGC TCGAACTCAA GACGCTGATC
GTTTCCAGCG ATCCGCAGGG CGGCTATCTG GCGCCGACCG AGATGGCGAC CGAGTTTATT
CGTGATCTGG TGCAGTACTC GCCGATCCGC GGTCTCGCTT CGGTCCGATC CACATCGGCG
CCTGCCGTGT CCTATCCCAA GCGCATCGGC CGCACGAATG CTCAATGGCG TGGTGAGACG
CAAGCGCAGA CGACTGGTGA GCCGACCTTT GGTCAGCTTG AAATCCCGGT TCGCGAGATC
AACACTTACG TCGATATCTC GAACCAGCTT CTCGCCGACA GCGCCGGCGC AGCCGAGTCC
GAAGTTCGGA TGGCACTCGC GGAGGACTTC GGCCTGAAAG AGGGACTGTC GTTCCTGAAG
GGCACCGGCC CGCTGCAGCC GGAAGGTTTG CTCATCAACG CGGACATCAG CATCGTCGCG
ACCGGCAACG CTTCCACGCT TGGCAGCGGA CCGGCCGACA TGCTCATCGA CACGTTCTAT
TCGCTTCCGG CTGCTTATCG GAACGCAGGC ACTTGGTTGA TGAATTCCAC GACGCTCGCG
GCCATCCGGA AGCTGAAGGA CGGCACGACC GGGACATACC TTTGGTCGCC CGGCTTCCAG
GGGCAGGCGG ACACCATCCT GGGTCGTCCA GTCATCGATT GCCCCGACAT GGATGATGTC
GGCAGCGGCA CCACGCCGAT CGCATTCGGC GACATCGCCG CGACCTATCG GATCCTCGAC
CGTATTGGTT TGTCGATTTT GGCCAATCCA TATTTGCTGG CGACCACCGG CACGACCAGA
ATTCATGCCA CGCGGCGTGT CGGCGGTGCG GTCGTTCAGC CAGCGGCGAT GAAGAAGATC
GTCTGCAAGA CCTCGTAA
 
Protein sequence
MGKHFSDLEF KDTDDADPVD LVTKAVNDLT TNVDARLKDI ETKADTTKLV DRMDKLEAKI 
NRPGSGDHKD PDAALEIKAF GAYVRSGITP PDPLELKTLI VSSDPQGGYL APTEMATEFI
RDLVQYSPIR GLASVRSTSA PAVSYPKRIG RTNAQWRGET QAQTTGEPTF GQLEIPVREI
NTYVDISNQL LADSAGAAES EVRMALAEDF GLKEGLSFLK GTGPLQPEGL LINADISIVA
TGNASTLGSG PADMLIDTFY SLPAAYRNAG TWLMNSTTLA AIRKLKDGTT GTYLWSPGFQ
GQADTILGRP VIDCPDMDDV GSGTTPIAFG DIAATYRILD RIGLSILANP YLLATTGTTR
IHATRRVGGA VVQPAAMKKI VCKTS