Gene Nwi_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1628 
Symbol 
ID3675420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1771955 
End bp1773133 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content61% 
IMG OID637713186 
ProductPhage portal protein, HK97 
Protein accessionYP_318241 
Protein GI75675820 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.220321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCT GGCCCTTCCG CCGCACTACT ATTGAAACCA AATCGCTCGC CGATCCGGAT 
GAAGCGATCT TCGCAATTCT CTCGGGCGGC AGCACGTACG GGCCGGTTCA TCCTCTCTCA
GAACCCGCTG TTTCCGCAGC CGTCACGACG ATCAGCAATG CTGCGGCAAC GCTCGATTTA
CACCTGGTCA AACGCGACGA CACGTCCGCC GACGTAAAAC ACCCCGCACT CGACCTGTTG
CGCGGTCACG TCAACGAATG GACATCCGGC TCCGACCTGA TCCGAGACCT TATCACACAA
GCACTAACCG CCGACGCCGG CGGGATTGCC TGGGTCAACA AAAGCGGCGA CGGCCGACCC
CTTGAAATTA TCCGGTATAC CGCGGGCCGC ATCACCGTCG AGTATGCCGG CGACGGCTCC
GGTCGCCCAT CCTACACCGT CAACGGTCGC AGCGTGCCGG CGTCGGACGT CGTTCACATT
CGCGGCCCAT TCTCGAAATG CCCGGTCAGT CTTGCCTATC CGACAATCTC CGCCGCAGTG
GCGATGTCGC GTTATGTCGA ATTCCTCTTC AAGCGGATGG CGCGTCCCGG CGGCGTTGTC
GAAGTGCCGA CCGGCGCGGG CGAAAAGGCC GTCCAGAACA TGATCGCGGG CTGGAACGCG
GCTTACGGCG GCGCGGACAA CGCCGGCGGC ACCGCATTCC TGTTTGACGG TGCGACGTTC
AAGCAGATCG CACTTGCATC GACCGACGCT CAGTTCGTCG AAAACCGCCG ATGGCAACTG
GAAGACATCG CGCGCGCATT CAACATTTCG AGCGTCATGC TCGGTGACCT CACAAAATCG
AGCTACGCCA ATGCATGGCA AAAATACAGA GAGTTCCTGA GCGTCACGCT GATGCCGTGG
CTCAAGGCGC TTGAATCTGC GTTCGACCGC GCGCTGCTGA CTGATGACGA GCGGGCGCTG
TACGCGTTCA AGTTCGACAT TGATGACCTG ACACGCGTGG ACCTGGAGAA GCGTGCGACC
GCCATTTCCA GCCTCGTTGC CAGTCGCGTG CTCAATCCAA ACGAAGCCCG CACATGGCTG
GATACCGGCC TTGCGCCTTA TGCCGGCGGC AACGAATTTG CCAACCCGAA CACCGGCGCC
AGCCAGCCTG GATCGCAGGA GCAGCCCGTA AATGACTGA
 
Protein sequence
MKLWPFRRTT IETKSLADPD EAIFAILSGG STYGPVHPLS EPAVSAAVTT ISNAAATLDL 
HLVKRDDTSA DVKHPALDLL RGHVNEWTSG SDLIRDLITQ ALTADAGGIA WVNKSGDGRP
LEIIRYTAGR ITVEYAGDGS GRPSYTVNGR SVPASDVVHI RGPFSKCPVS LAYPTISAAV
AMSRYVEFLF KRMARPGGVV EVPTGAGEKA VQNMIAGWNA AYGGADNAGG TAFLFDGATF
KQIALASTDA QFVENRRWQL EDIARAFNIS SVMLGDLTKS SYANAWQKYR EFLSVTLMPW
LKALESAFDR ALLTDDERAL YAFKFDIDDL TRVDLEKRAT AISSLVASRV LNPNEARTWL
DTGLAPYAGG NEFANPNTGA SQPGSQEQPV ND