Gene Nham_1719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1719 
Symbol 
ID4030319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1921475 
End bp1922629 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content59% 
IMG OID637970191 
Productphage major capsid protein, HK97 
Protein accessionYP_576995 
Protein GI92117266 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTCC ACACTGCTAC TGCTCTTGAG ACCAAGAACG CAGCCGAACT TCCTGCGGAA 
GACGGCGCGG CCGAAATCAA GTCGGCGCTT GAAGCGCTGA CCGCTGACGT CAACACGAAG
ACGGCACCGG TCGCCGATCT TGAAAAGCGA CTTGCCGCGG CCGAAGTCAA ACTCGCCCGC
CCTGCCATTC ATACTGAGAA GAAGGACGAG ATCAGCGCTG AACGCAAAGC ATTCACCGGC
TACCTGCGCA ACGGCAAAGA GACGTTGACC GCGGATGAGG TGAAGTCGCT CACGATCGCC
CCAGATAGCT CTGGAGGTTA CCTCGCTCCG ATTGAGTTCA GCGCGGAAGT CGTGAAGGGT
ATTGTCGAGC AATCGCCCGT TCGCCAGGCT GCTCGCGTGG GCAACACTTC GAGCGGCGAA
GTTCTAATTC CTAAGCGCAC GGGTCGCCCG ACTGGCAAGT GGGTCGGTGA AACCGAGACC
CGCACCGGAA CGGAGTCCAG TTACGGTCAG GTTGAGATCC CGATCCATGA AATGGCGTGC
TATGTGGATG TTTCGCAGCG CCTCTTGGAA GACGCCGCGG TCAACGTTGA AGCCGAAGTT
GCCTTCGACC TCGCCGAGGA ATTCGGACGT CTGGAGGCTC TTGGTTTCCA GCGTGGTGAT
GGCGTGAAGA AACCGCTCGG CGTCATGGCG TCTGCGGGTA TCGCCTACAC GCCAACGGGC
AACGCTTCGA CGCTCGGCAC GAATCCTGCC GACACTATCA TCGATGCGTT CTACGCCTTG
CCGGCGTTCT ATCGTGGCCG GTCGGTTTGG ATGATGAACT CCAAAACCAT AGCCACCGTT
CGCAAGCTGA AGGATGGCAC CACGGGCAGT TACCTGTGGC AGCCCGGTTT GGCCGCGTCT
GATCCTTCGA CCATCCTCGG GCGTCCGGTT ATCGAAGATA ATACGCTCGA CGATGTTGGC
AGCGCAGCAG AGCCGATCGT GTTCGGCGAC TTCGCCTCTG CGTACCGCAT TTATGATCGC
GTCGCATTGA GCCTGCTTCG TGACCAGTAC AGCCAAGCTG CAAATGGTCT GGTGCGTTTC
CACGCGCGTC GTCGTGTTGG TGGTGCGTTG GTTCTCGCCG ATGCGCTGCG CAAGATCAAG
TGCGCGACCA GCTAA
 
Protein sequence
MTFHTATALE TKNAAELPAE DGAAEIKSAL EALTADVNTK TAPVADLEKR LAAAEVKLAR 
PAIHTEKKDE ISAERKAFTG YLRNGKETLT ADEVKSLTIA PDSSGGYLAP IEFSAEVVKG
IVEQSPVRQA ARVGNTSSGE VLIPKRTGRP TGKWVGETET RTGTESSYGQ VEIPIHEMAC
YVDVSQRLLE DAAVNVEAEV AFDLAEEFGR LEALGFQRGD GVKKPLGVMA SAGIAYTPTG
NASTLGTNPA DTIIDAFYAL PAFYRGRSVW MMNSKTIATV RKLKDGTTGS YLWQPGLAAS
DPSTILGRPV IEDNTLDDVG SAAEPIVFGD FASAYRIYDR VALSLLRDQY SQAANGLVRF
HARRRVGGAL VLADALRKIK CATS