Gene Nham_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_3337 
Symbol 
ID4032250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp3679901 
End bp3683440 
Gene Length3540 bp 
Protein Length1179 aa 
Translation table11 
GC content63% 
IMG OID637971747 
Productphage terminase GpA 
Protein accessionYP_578529 
Protein GI92118800 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.27063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCCGC CTCCGAGGAT GAAGCTGTCG GAATGGATTG AGCGTGAGCT TGTTCTGCCT 
TCCGATGTCG CTGCTCTAGC CGGGAAGGTT CGTCTCTACC CGTTTCAGCG CGAGATTGCT
GACGCCATCG GCGACCCCAC GATCGAGCGG GTCACGCTGG TCAAGCCGGT GCGCGTCGGA
TTCACGACGC TGCTCACCGG CGCCATGGCC GGGTTCTGCG CCAACGATCC GGCGCCGATC
CTTTCGCTGC TGCCAACGGA ATCGGACTGC CGCGATTATA TGGTTTCCGA CGTCGAGCCG
ATTTTCGCGG CGTCGCCGTC GATCAGCAAT CTGCTCTCGG GCGATCTGGA TGAGGCCGGC
CGCAACACGC TGGTGTCGCG GAGATTTCCC GGCGGCTCGT TAAAGGTGGT CGCGGCGAAA
GCGCCGCGCA ATCTGCGTCG GCACAACGTT CGGGTGCTTT TCATCGACGA AGCCGATGCG
ATGGAGCCGA CGGCGGACGG CTCGCCGGTC GTTCTCGCTG AAGACCGCAC GATCTCTTTC
CCGGATCGCA AGATCATCAT GGGGTCGACG CCGCGCTTCA TGGAGACGAG CTACGTTCTT
GCGGCTTACG CAAAATCGGA TCAGCGCATC TACGAGGTGC GATGTCCGGA GTGCGATGAG
TTTCACGAAA TCCAGTGGAA GGACATCCAC TGGCCGGAAA ACGAGCCGGA CAAAGCGGCG
TGGTGCTGTC CTGGCTGCGG TGTTGAGATC GGGGAAAAAC ACAAGGCCGC GATGGTTGCC
GCCGGGCGCT GGCGGGCATT GGCGCCGGAG GTTAAAGGCC ACGCCGGTTT CAAGCTGAAT
GCGCTGATCT CGCCGCTGTC GAATGTGGCA TGGGGTGCGC TGGCGGCGCG CTTCGTCCAG
GATAAAAATG ATCCGGCTAC CTTGCAGCCC TTCGTCAACA CCGTGCTGGC CGAGGGTTGG
CGTGAGGAGG GCGAGGAACT CGACGAAGCA GAACTCTCAA CCCGAGCGGA GCCGTTCAGT
CTGGTTGCCA ACGAAGAAGC CGGCACCACG GGCATCCCCG ATCTGGTGAT GGTCATCACC
GCCGGCGTCG ACGTGCAGCG CAAGGATCGA CTGGAGGTTA CCTTCATCGG CTGGGATGAG
GCGGGCAACG CTTACATCCT CGGCCACACC GTCATCTGGG GCATGTGGGA TGACGACACG
ACCTGGGCCG AACTCGATGC GGTTCTTGCA ACAAAATGGG ATCACCCGTT TGGCGGCAAG
ATCGGGATCG ACGCGGCCTG CGTCGACAGT TCGGACGGCG TGACCATGGA GACGGTCTAC
CGGTTTGCCT TCCCGCGCTT CCGCAGGAAG GTGCTGGCAA TCAAGGGTGT TCAAGGGACG
CGGCCGTGGA TCGAAAAATC GAAGTCGAAG ACGAAGGGCG GCTCGCTCTG GATCGTCGGC
ATCGACGGCA TCAAGGGTAC GATCTATTCG CGTCTGAGGC GCTCGAACAT GATCCGGTTC
TCGATGGACT TGCCGGTCGA TTGGTACGAG CAGCTCGCCT CGGAGCGTGT CGTCATCAAG
TACAACCGCG GCCAACCGAC ACGCCGTTTT GAACGCATCA CAGGCCGACT CGCCGAGGCG
CTCGACTGTA CGGTCTACGC CTTCGCCGCA AGGCAAGAGG TCGTATGGAC GACCGGCGCG
TCCGTTCGCC GCCGGGATTG GCGCACCGGC GTCTATTACA ACGAGATTCT GGATCTGCGG
CAGGGCGCCG TCCTGCTGGA TCGCCTCAAT GCCGGAGCGC CGCTGCTCGA CACACATGAC
GATTGGTCAC TGCGATCGGT GATCGGAAAC GTAGTGCCGG GCTCTGCGAA GATCGAAGGC
GGCAAGGGTC TTGCTCGGGT CCGGCTTTCG AATGCGCCAG GTGACGCCGA CACGGTGATC
AAGATTCGCG ATGGCAACAT TCGAAACATC AGTGTCGGCT ACGCGATCCA CAAAGTCGAG
AAGACCGATA ACGGCGAGGG CGCCGACGAA GATTGGCGCG TGGTCGATTG GGAGCCTTTG
GAAATATCCG CTGTTCCCAT ACCCGCCGAC CCCGGATCCG GCTTTCGCAG TGCCGACAAG
GCCGAGCAGT TCCCATGCGT TTTCGTTTCA ACCCGCTCGA ACAAGGAGGC TTCCATGCCC
GAGAGCACCA CTGTCGTAGC GGGAGATGAC CCCGCGACCA TCGAAACGCG TCAGCGGCCT
GATCCGGCTC CTGCTCCAGT CCAGCAGCGC CAGGCGCCGT CCCCGACGGA AGCGGCCGAT
GCCGCTGTGC GTGCCGAGCG CGAACGCGTT TCCACGATCA CCGATCTTGC GCGTCGCGCC
GCGGCGGTCG ACCTGGGCGA GCAGCACGTC CGCTCCGGCA CTGCCGTGGA TGCATTCCGC
ACTGCGCTGC TCGACCACAT GGTCAGCAGG GAAGCCGCAA CCCCGACCGA CAGCAATGTC
CGGGCGCACG TAGGGACCGA GGAATCGGAA ACGCGCCGCG GCCTGATGAT CGAGGCGCTG
GCCTATGGCC TCGGCGCTCC GCTTCCGCAG GCCGGCCCGA GCGAGGGCGC GCGCCAATAT
ATGGGGCGCG GCCTGGTCGA TCTCGCCGCC GACAGCGTGA ACTTCCGCGG TGGCCGCATG
CTCAATGCCC GCCAGATCGA CGATATCTTC ACCCGTGCAT CGCACACCAC GTCCGACTTC
CCGATTATCT TCGAGGGCGC CATCAACCGC ACCCTGGAAC AGCGCTTTGC CCTGGCACAG
CCGACGTTCC GGCGTTTTGC CCGCAAGCGG AATTTCCGTG ACTTCCGGCC GGACACGACC
GTGAAGATCG GCGACTTCCC GATGCTGCAG AAGGTGCTGC AAAGCGGCGA GATCAAGTAC
GGCAGCTTCG TCGAAGGCAA GGAGCAGGTG CAGGCATTCA GCTACGCCAT CGCTCTGCGC
GTTACCCGGC AAATGCTGAT CAACGACGAT CTCGGTGCAA TCTCCGAGTT GCTGTCGAGC
TATGGCTCGT CTGTCGCGCT GTTCGAAGAG GTCACATTCT ACTCCACCGC GTTCAACAGC
AAGCTGGCCG ATGGGAAGAC CGTGTTCCAC GCCGATCACG CCAACCTTGC TGCGGCGGGC
ACCGGGATCG ATGTCGACAA CGTCGGCAAG GCGCGAGCGG CCATGTCCAA GCAAAAGAGC
ACCGAAGGCA ACCCTCTGCT GGCGAATTCG CCGAAGATCC TGCTCGTTGG TCCCGACAAG
CTGACCGATG CCGAAAAGCT GCTGGCCTCG ATCACGCCGG CCACGGTCTC CAACGTCAAC
ATCTTCTCCG GCCGGCTGGA GCCGCTGGAA AGCTCGCAGC TGTCGGGCAA CGCCTGGCAT
CTGCTGACCG ACCCGGCGGC CGGCTCGAAT TATCGCTGGG GCTATCTGGA AGGCTACGAG
GCTCCTCGCG TTCGCATGGA CGAGCCGTTC GGCCAGCAGG GCTTTGCGAT GTCGGTCGAG
CACGACTTCG GCTGCGGCGC GACCGACTTC CGCTTCGGCT ACAAAAACCC CGGCGCGTAA
 
Protein sequence
MVPPPRMKLS EWIERELVLP SDVAALAGKV RLYPFQREIA DAIGDPTIER VTLVKPVRVG 
FTTLLTGAMA GFCANDPAPI LSLLPTESDC RDYMVSDVEP IFAASPSISN LLSGDLDEAG
RNTLVSRRFP GGSLKVVAAK APRNLRRHNV RVLFIDEADA MEPTADGSPV VLAEDRTISF
PDRKIIMGST PRFMETSYVL AAYAKSDQRI YEVRCPECDE FHEIQWKDIH WPENEPDKAA
WCCPGCGVEI GEKHKAAMVA AGRWRALAPE VKGHAGFKLN ALISPLSNVA WGALAARFVQ
DKNDPATLQP FVNTVLAEGW REEGEELDEA ELSTRAEPFS LVANEEAGTT GIPDLVMVIT
AGVDVQRKDR LEVTFIGWDE AGNAYILGHT VIWGMWDDDT TWAELDAVLA TKWDHPFGGK
IGIDAACVDS SDGVTMETVY RFAFPRFRRK VLAIKGVQGT RPWIEKSKSK TKGGSLWIVG
IDGIKGTIYS RLRRSNMIRF SMDLPVDWYE QLASERVVIK YNRGQPTRRF ERITGRLAEA
LDCTVYAFAA RQEVVWTTGA SVRRRDWRTG VYYNEILDLR QGAVLLDRLN AGAPLLDTHD
DWSLRSVIGN VVPGSAKIEG GKGLARVRLS NAPGDADTVI KIRDGNIRNI SVGYAIHKVE
KTDNGEGADE DWRVVDWEPL EISAVPIPAD PGSGFRSADK AEQFPCVFVS TRSNKEASMP
ESTTVVAGDD PATIETRQRP DPAPAPVQQR QAPSPTEAAD AAVRAERERV STITDLARRA
AAVDLGEQHV RSGTAVDAFR TALLDHMVSR EAATPTDSNV RAHVGTEESE TRRGLMIEAL
AYGLGAPLPQ AGPSEGARQY MGRGLVDLAA DSVNFRGGRM LNARQIDDIF TRASHTTSDF
PIIFEGAINR TLEQRFALAQ PTFRRFARKR NFRDFRPDTT VKIGDFPMLQ KVLQSGEIKY
GSFVEGKEQV QAFSYAIALR VTRQMLINDD LGAISELLSS YGSSVALFEE VTFYSTAFNS
KLADGKTVFH ADHANLAAAG TGIDVDNVGK ARAAMSKQKS TEGNPLLANS PKILLVGPDK
LTDAEKLLAS ITPATVSNVN IFSGRLEPLE SSQLSGNAWH LLTDPAAGSN YRWGYLEGYE
APRVRMDEPF GQQGFAMSVE HDFGCGATDF RFGYKNPGA