Gene Nham_0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_0337 
Symbol 
ID4030493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp366230 
End bp367378 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content60% 
IMG OID637968871 
Productphage major capsid protein, HK97 
Protein accessionYP_575693 
Protein GI92115964 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.139658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTTC ATTTTCTTGA GACCAAATCT GCGGCTGACG TCGACGAAGG CGATCCGTCC 
ATCGTTGAAG TCAAGTCGGC GCTGACCGCT CTTACGGAAG ACGTAAAGAA GGCCACCGCA
CCGGTTGCCG ATCTGACCAA GCGCCTCGAC GAAATCGAGA CGAAGATCAA TCGTCCGGCC
ATTCACACTG AGAAAAAGGA CGAGATCAGC GACGAGCGCA AAGCGTTCAC CGGCTATCTT
CGCCGCGGCA AGGAAACGCT CCAGCCGGAC GAGATCAAGT CGCTGCGCGT TGCCGATGAT
ACCTCGGGCG GCTATCTGGC GCCTGCGGAG TTCAGCGCCG AAGTGGTCAA GGGCATCGTG
GAAATGTCGC CGATCCGTCA GGCGGCTCGC GTTGGCTCTA CGTCCAGCGG TGAAGTTCTG
CTGCCGAAAC GTACCGGCCG TCCGACCGGA TCGTGGGTTG GCGAAACCGA TGCGCGTCCG
GGCACGGAAT CGAGCTATGG TCAGATCGAA GTGCCGATCC ATGAAATGGC TTGCTACGTT
GACGTGTCAC AGCGCCTGCT TGAGGACGCG GCAGTCAATG TCGAGTCCGA AGTTGCTTCT
GACTTGTCCG AGGAATTCGG TCGGCTTGAA GGTCTCGGCT TCTCGCAGGG CGATGGCGTA
AAGAAGCCGA TTGGCATCAT GGAAGCGGCT GGCGTTGCCT ATACCGCGAC CGGCAATGCT
TCGACGCTTG GCACCGCGCC GGCCGACACC CTGATCGACG TTTTCTATTC GCTCCCGGCG
TACTATCGCA ATCGCGGCGT CTGGCTGATG AATTCGAAGA CGATCGCAGC GGTTCGCAAG
CTGAAGGACG GTTCTACCGG TGCCTACCTG TGGCAGCCTG GCCTTGCGCA GGGTGACCCG
GCGACGATCC TTGGCCGTCC GCTGATTGAA GATCCGACCA TGGATGACAT CGGCTCCGCT
GCCGAGCCTA TCCTGTTCGG TTCGGTTTCC GATGCCTATC GCATCTATGA CCGACTGAAT
CTTTCGATCA TGCGCGACCC GTACTCGCAG GCAACGTCTG GCGTTGTCCG CTTCCATGCG
CGTCGTCGCA CTGGCGGTGC GTTGGTTCTT GCCGATGCGC TCCGCAAGAT CAAGTGCGCG
ACCAGCTAA
 
Protein sequence
MTFHFLETKS AADVDEGDPS IVEVKSALTA LTEDVKKATA PVADLTKRLD EIETKINRPA 
IHTEKKDEIS DERKAFTGYL RRGKETLQPD EIKSLRVADD TSGGYLAPAE FSAEVVKGIV
EMSPIRQAAR VGSTSSGEVL LPKRTGRPTG SWVGETDARP GTESSYGQIE VPIHEMACYV
DVSQRLLEDA AVNVESEVAS DLSEEFGRLE GLGFSQGDGV KKPIGIMEAA GVAYTATGNA
STLGTAPADT LIDVFYSLPA YYRNRGVWLM NSKTIAAVRK LKDGSTGAYL WQPGLAQGDP
ATILGRPLIE DPTMDDIGSA AEPILFGSVS DAYRIYDRLN LSIMRDPYSQ ATSGVVRFHA
RRRTGGALVL ADALRKIKCA TS