Gene Nwi_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1161 
Symbol 
ID3675796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1272467 
End bp1273642 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content65% 
IMG OID637712711 
ProductPhage portal protein, HK97 
Protein accessionYP_317775 
Protein GI75675354 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.255409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTGC GCTTCAAAAA CATGTTTGCT CCGCCCGAGA CAAAAGCCAG TCGCACGGCC 
AGGCTGCTCG CCTTCGAGAG CGGCGGCCGC GCGCGCTGGA CGCCACGCGA CTACGCGGCG
CTGGCCCGCG AAGGCTATCT GGCCAATCCC GTCGTACATC GCGCGGTGCG GCTGATCGCG
GAGAATGTGG CGTCTTGCAG CTACCTCGTG TTCGAGGGCG CGCAGGAACG CGAGGCGCAT
CCGCTATCGT TGCTGCTCAC GCGTCCGAAT ACGCGGCAGG ACGGCGGCGC GTTTCTCGAA
ACGCTGGTGT CGCATCTGTT GCTGGCGGGG AACGCCTATG TCGAAACAGT CGCGCTCGAC
GGGGCGGTTC GCGAACTTCA CGCGCTGCGT CCCGATCGCA TGAAGGTGGT GCCGGGTCCC
GAGGGTTGGG CGGAAGCGTA TGAGTACAGC GTCGGCGGGC GCAGCGTGCG CTTTGATCAG
GCCTCATCCG TCGTGCCACC GATCCTGCAC CTGACGTTCT TTCATCCGCT CGACGATCAC
TACGGTCTTG CGCCGATCGA ATCCGCCGCC GTCGCCATCG ACACCCATAA CGCCGGATCG
AAATGGAATA AGGCGCTGCT CGACAACGCC GCGCGGCCGT CCGGCGCGCT GGTCTATGCC
GGGCCGGAGG GCGCGGTGCT CTCGGATTCA CAGTTCGACC GGCTCAAGCG CGAGTTGACC
GATACCTATC AGGGCGCGGT GAACGCGGGC CGGCCGCTGC TGCTCGAAGG CGGACTCGAT
TGGAAAGCGA TGTCGCTGAC GCCGAAGGAC ATGGATTTCC TGGAGGCCAA GCACACGGCC
GCGCGCGAGA TCGCGCTCGC TTTCGGCGTG CCGCCGATGA TGCTCGGTAT TCCCGGCGAC
AACACCTACG CCAATTTCCT GGAAGCCAAT CGCTGCTTCT TTCGCCAGAC CGTGTTGCCG
CTGGCGTCGC GTATCGGCAA TTCGTTCGCG CAGTGGCTGT CGCCGCAGTT CGGCGAAAGC
ATCCGCGTCG TCGTCGACAC CGACAAAATG GACGCGCTCG CCGCCGATCG TACGGCGTTG
TGGGAACGGG TCAGCGATGC GGCCTTTCTC ACGCTCAACG AAAAGCGCGA GGCGGTCGGC
TATGCGCCGA TCGAGGGCGG CGACCGCTTG GAGTGA
 
Protein sequence
MRLRFKNMFA PPETKASRTA RLLAFESGGR ARWTPRDYAA LAREGYLANP VVHRAVRLIA 
ENVASCSYLV FEGAQEREAH PLSLLLTRPN TRQDGGAFLE TLVSHLLLAG NAYVETVALD
GAVRELHALR PDRMKVVPGP EGWAEAYEYS VGGRSVRFDQ ASSVVPPILH LTFFHPLDDH
YGLAPIESAA VAIDTHNAGS KWNKALLDNA ARPSGALVYA GPEGAVLSDS QFDRLKRELT
DTYQGAVNAG RPLLLEGGLD WKAMSLTPKD MDFLEAKHTA AREIALAFGV PPMMLGIPGD
NTYANFLEAN RCFFRQTVLP LASRIGNSFA QWLSPQFGES IRVVVDTDKM DALAADRTAL
WERVSDAAFL TLNEKREAVG YAPIEGGDRL E