Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_1037 |
Symbol | |
ID | 4031643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 1157541 |
End bp | 1159877 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637969535 |
Product | RNA binding S1 |
Protein accession | YP_576345 |
Protein GI | 92116616 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.255171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAAACA TCAATCGACG GATCGCGGAA GAACTCGGCG TTCGCGAGCA GCAAGTCGAA GCGACGGTAG CGTTGCTCGA CGGTGGCGCA ACTGTTCCTT TCGTCGCCCG TTACCGCAAG GAAATCACGG GTTCGCTGGA TGACGCTCAA TTGCGGACGC TGGAAGAACG TTTGAATTAT TTGCGCGAAC TTGAAGACCG TCGTGTCACG ATTCTCAATT CTGTTCGCGA ACAGGGCAAG CTCGACGCCG CACTGGAAGC GGCGATCCTC GCTGCCGACA GCAAGGGGCG TCTGGAGGAC ATTTACCTTC CGTTCAAGCC GAAGCGCCGT ACCAAAGCTG AGATCGCCAA AGAGCGCGGG CTGGAGCCGT TGGCTCACTT GCTGCTGGCC GAACCGCAGA ATGATCCAAA GACGGTAGCA GAGCCGTTCG TCAACGCCGA AAAGTCGATC GCCGATGTCG CGGCTGCGCT CGACGGCGCC CGCGCTATCC TGGTGGAACA TTTCGCGGAA GATGCCGACC TGATTGGCGC GCTGCGAGAA CAGATGTGGT CGAACGGTCT GATGGCGTCC ACCGTGCGTT CCGGCAAGAA GACCGAGGGC GAAAAGTTCA AGGACTATTT CGATTTTAGC GAGCCGCTCA CCAAGTTGCC GTCGCATCGC ATCCTCGCAA TGTTTCGCGG TGAGAAAGAG GATATCCTCG ACCTTCAAAT GTTGCCCGAA CCAGTTTCCG CTACGCCGGC CCCGGTCAGC CCCTGCGAAT TGAAGATCAT GCGGCGTTTC GCGATTTCCG ATCGTGGCCG GGCCGGCGAC AAGTGGCTAA TAGAAACGGT GCGCTGGGCC TGGCGGACCA AGATCCAGGT TCATCTCAAT GTTGATTTGC GGATGCGACT GTGGAACGCG GCCGAGCAGG AGGCGGTACG CGTGTTCGCG TCGAATTTGC GCGACCTCCT GCTGGCGGCG CCGGCGGGCG CGCGCGTAAC CATGGGTCTC GACCCCGGAT TCCGCACCGG CGTCAAGGTC GCTGTTGTCG ATGCGACCGG CAAGGTGGTG GCGACCACGG CCGTCTATCC GCACGAGCCT CAGCGGCAAT GGGATGCCAC GCTCGCCACG CTCGGCAAGC TCGCCGTTCA ACATCGCGTC GACTTGATCG CGATCGGCAA CGGCACGGCG TCGCGCGAGA CCGACAGGCT CGCTATGGAT CTGGTAAAAC TCCTGCCCGA TATGAAGATG TCGAAGATCG TGGTGTCGGA GGCCGGCGCT TCGGTTTATT CGGCGTCGGC CTTTGCGTCC GAGGAACTGC CCGAACTCGA CGTCACGCTG CGCGGCGCGG TATCGATTGC TCGACGACTG CAAGATCCAT TGGCCGAACT GGTCAAGATC GATCCGAAGG CGATTGGGGT GGGACAATAT CAGCATGACC TTGGCGAGAG TAAGCTGGCT CGCTCCCTCG ATGCGGTGGT TGAGGATTGT GTTAACGCGG TCGGAGTGGA TGCCAACACG GCGTCTGCGC CGCTTCTGGC ACGCGTATCA GGGATCGGCG CGGGGTTGGC GCAAACCATT GTCCAACACC GTGACGACAA TGGGCCGTTC AAGACACGGA AGGGGCTGAA GCAGGTGCCC CGGCTGGGGC CGAAAGCGTT CGAGCAATGC GCAGGTTTCT TGCGGATCAC CGGCGGCGAC GACCCGCTCG ATGCCTCGGG AGTCCATCCG GAGGCTTACC CCGTCGTGCG CAAAATTCTT GCTGCAACCA GGAGCGACAT CAAGGCGCTG ATCGGAAACG TGGATATCCT GCGTCAGGTC AAGCCGCAGA ACTTCGTTGA CGACACCTTC GGTTTGCCGA CCGTGGTCGA CATTCTGCGC GAGCTGGAAA AACCCGGTCG CGATCCGCGC CCGGCCTTCA AGGCTGCGGT GTTCAAAGAG GGAATCGAGA CACTCAAGGA TGTTAAACCC GGTATGATCC TCGAAGGTGC CGTAACCAAC GTGGCGGCGT TCGGCGCCTT CGTCGATATC GGTGTCCATC AAGATGGCCT GATCCACATT TCGGCGATGT CGAAGTCCTT TGTCAAAGAC CCGCGCGCGG TCGTAAAGTC GGGTGACGTC GTCAAGGTGA AGGTATTGGA GGTGGATGTC GCCCGTAAGC GGATCGCATT GACGCTCCGT CTCGATGACG AACTCGGCGG CAAGAGCGAG CGTGCGGCGC AAGGCCTCCC ACGAGACAAA ACCCGCCCGA TGACATCGCC GCCGGCCAGG CCGCATCAGC AGTCCGGCGG CGGAGCGCTT GCTGAAGCCC TGCGGCGCGC GACCGAGAAG AGCGGAAGCG GCAAGACTAA GTCATAG
|
Protein sequence | MANINRRIAE ELGVREQQVE ATVALLDGGA TVPFVARYRK EITGSLDDAQ LRTLEERLNY LRELEDRRVT ILNSVREQGK LDAALEAAIL AADSKGRLED IYLPFKPKRR TKAEIAKERG LEPLAHLLLA EPQNDPKTVA EPFVNAEKSI ADVAAALDGA RAILVEHFAE DADLIGALRE QMWSNGLMAS TVRSGKKTEG EKFKDYFDFS EPLTKLPSHR ILAMFRGEKE DILDLQMLPE PVSATPAPVS PCELKIMRRF AISDRGRAGD KWLIETVRWA WRTKIQVHLN VDLRMRLWNA AEQEAVRVFA SNLRDLLLAA PAGARVTMGL DPGFRTGVKV AVVDATGKVV ATTAVYPHEP QRQWDATLAT LGKLAVQHRV DLIAIGNGTA SRETDRLAMD LVKLLPDMKM SKIVVSEAGA SVYSASAFAS EELPELDVTL RGAVSIARRL QDPLAELVKI DPKAIGVGQY QHDLGESKLA RSLDAVVEDC VNAVGVDANT ASAPLLARVS GIGAGLAQTI VQHRDDNGPF KTRKGLKQVP RLGPKAFEQC AGFLRITGGD DPLDASGVHP EAYPVVRKIL AATRSDIKAL IGNVDILRQV KPQNFVDDTF GLPTVVDILR ELEKPGRDPR PAFKAAVFKE GIETLKDVKP GMILEGAVTN VAAFGAFVDI GVHQDGLIHI SAMSKSFVKD PRAVVKSGDV VKVKVLEVDV ARKRIALTLR LDDELGGKSE RAAQGLPRDK TRPMTSPPAR PHQQSGGGAL AEALRRATEK SGSGKTKS
|
| |