Gene Nham_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1994 
Symbol 
ID4031547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2210995 
End bp2212767 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content59% 
IMG OID637970451 
ProductATP-dependent OLD family endonuclease 
Protein accessionYP_577253 
Protein GI92117524 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGATCC ATCCCAACAT AAGACGTCTC GATATAAGTC GCTTCAGGTG TATCGAGCAT 
CTCGAGTGGC GGCCCGACCA GGGCGTCAAT GTCTTGGTCG GTGGTGGCGA CTCGGGCAAG
AGCACCGTGC TCCATGCGAT AGCGTTGCTC TTCAGCCCTA CGAATATGGT CCAAGTGTTC
GAGACCGACT ATTTCAATCG TAGCAGTGAG CAGGGCTTCT CGATCGAAGC GGTCGTCGAG
CTTCCAGAAG AAGTCGGCAT AGGCAATCTT CAACAAACAT TGTGGCCTTG GGCGTGGAAC
AAGCAGGCCG CTGTTCTCCC CAATCCCGAT GTGTCCGGCG GACCGGAAAC GCCAGTTTAT
CGCTTTCGCG CACGAGGGAC CAACGAACTC GAGCTCACCT GGGAAGTCAT TCAGCCGAAC
GACGAGGTCG TCGGATTGCC TGTCGGATTG CGTCGCAAGA TCGGCGTCGT CCGCCTGGCC
AACGATGATC GCAATGATCG CGATCTCCGC CTAGTCGCAG GCTCCGCGCT CGATCGGTTG
CTCTCGAGCG GCAATCTTAA ATCCCGGATC AATAAGCAGA TCGCGGAGAC GGACCTAGCC
ACCGCGCTTC TCGATAGCGA GATTGAGGCC CTCCGAACGT TGGGTACAAC GCTGGAAAAG
GCGGGCCTTC CGCACGACCT CGCACTTGGC CTCACAAGCA GTCAGGGACT GTCGATCGGA
GCACTCATCG GCCTGCTGGC CAGCCGCGAC GGCGTCATGC TTCCGCTGGC CAGTTGGGGT
GCCGGCACGC GTCGTATGTC GTCGTTGGAG ATCGCCGCAA GCACTGAATC TGCCACCAGG
TTGACGATCA TCGATGAGAT CGAGAGAGGG CTTGAGCCCT ATCGACTCCG GCAACTCATC
GCCAAACTGG ACGACGGGGG CGGGCAGTGT TTCGTTACCA CGCACAGCCC GGTCGCGGTC
GCGGCATCAG GAAACGGACA ATCCGCATTG TGGTTCGTGG ATGCGAAGGC TCAGATCGGA
TCTCTGCCGC GGGAAGCCAT TGAGCGTCAG CAGAAGCGCG ATCCGGAGAC GTTCCTGGCG
AAGCTTCCAA TCATCGCCGA GGGGGTCACT GAAGTCGGAT TTCTACGCAG GGTCCTTCGG
AATGCGTTCA CTGCGCCGCC TGCATTTTTG GGCCTCAGGG TCTGCGACGG TGGCGGGAAT
GATGCCATGC TTGGATTGTT GGAGGCGCTG CGGAGCGCCG GGTTATCAAT TGGAGCATTC
TGTGACGACG AAGGTAGGTT CTCCGGTCGG TGGAAGGCAG TCTCTGAAGC CTTAGGTCCG
CGTTTCTTCC AATGGTCCAA GGGGTGTTTG GAGGAGAACG TTATCAAACA TGTCATGGAT
AAAGATCTAA TGGCGCTCGC GCTCGACCCC GAAGGGGCGT CCGGCCGGAG GTTACGCACG
ATTGCGGTTC GATTGAACCT GTCGGTCAAG GACGAAACGA GCATTCTGAA CGCCTGCGGA
ACTCCCGAAT ATCGGTATGC AAAACTGCGA TCGATCATCA TCGCCGCAGC GACTGGCGAC
GATGAAGGAG CGCCGAACGA CGAGGCAAAG AAGGAATGGA GGAGGCACAG CCAAGAATGG
TTTAAATCCC TGCAAGGCGG AGCCGAACTG GCGGTCAAGG CGATGGATTT GAAAGTTTGG
CCGAAGATCG AGGCGGATCT CTTGCCGTTT ATCAATGCGC TGGGCTCCGC ATTTGGTCAG
GCACCTCTAT CTCCGGGGGC GCTGAAGCCG TGA
 
Protein sequence
MAIHPNIRRL DISRFRCIEH LEWRPDQGVN VLVGGGDSGK STVLHAIALL FSPTNMVQVF 
ETDYFNRSSE QGFSIEAVVE LPEEVGIGNL QQTLWPWAWN KQAAVLPNPD VSGGPETPVY
RFRARGTNEL ELTWEVIQPN DEVVGLPVGL RRKIGVVRLA NDDRNDRDLR LVAGSALDRL
LSSGNLKSRI NKQIAETDLA TALLDSEIEA LRTLGTTLEK AGLPHDLALG LTSSQGLSIG
ALIGLLASRD GVMLPLASWG AGTRRMSSLE IAASTESATR LTIIDEIERG LEPYRLRQLI
AKLDDGGGQC FVTTHSPVAV AASGNGQSAL WFVDAKAQIG SLPREAIERQ QKRDPETFLA
KLPIIAEGVT EVGFLRRVLR NAFTAPPAFL GLRVCDGGGN DAMLGLLEAL RSAGLSIGAF
CDDEGRFSGR WKAVSEALGP RFFQWSKGCL EENVIKHVMD KDLMALALDP EGASGRRLRT
IAVRLNLSVK DETSILNACG TPEYRYAKLR SIIIAAATGD DEGAPNDEAK KEWRRHSQEW
FKSLQGGAEL AVKAMDLKVW PKIEADLLPF INALGSAFGQ APLSPGALKP