Gene Nham_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1037 
Symbol 
ID4031643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1157541 
End bp1159877 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content61% 
IMG OID637969535 
ProductRNA binding S1 
Protein accessionYP_576345 
Protein GI92116616 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.255171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAACA TCAATCGACG GATCGCGGAA GAACTCGGCG TTCGCGAGCA GCAAGTCGAA 
GCGACGGTAG CGTTGCTCGA CGGTGGCGCA ACTGTTCCTT TCGTCGCCCG TTACCGCAAG
GAAATCACGG GTTCGCTGGA TGACGCTCAA TTGCGGACGC TGGAAGAACG TTTGAATTAT
TTGCGCGAAC TTGAAGACCG TCGTGTCACG ATTCTCAATT CTGTTCGCGA ACAGGGCAAG
CTCGACGCCG CACTGGAAGC GGCGATCCTC GCTGCCGACA GCAAGGGGCG TCTGGAGGAC
ATTTACCTTC CGTTCAAGCC GAAGCGCCGT ACCAAAGCTG AGATCGCCAA AGAGCGCGGG
CTGGAGCCGT TGGCTCACTT GCTGCTGGCC GAACCGCAGA ATGATCCAAA GACGGTAGCA
GAGCCGTTCG TCAACGCCGA AAAGTCGATC GCCGATGTCG CGGCTGCGCT CGACGGCGCC
CGCGCTATCC TGGTGGAACA TTTCGCGGAA GATGCCGACC TGATTGGCGC GCTGCGAGAA
CAGATGTGGT CGAACGGTCT GATGGCGTCC ACCGTGCGTT CCGGCAAGAA GACCGAGGGC
GAAAAGTTCA AGGACTATTT CGATTTTAGC GAGCCGCTCA CCAAGTTGCC GTCGCATCGC
ATCCTCGCAA TGTTTCGCGG TGAGAAAGAG GATATCCTCG ACCTTCAAAT GTTGCCCGAA
CCAGTTTCCG CTACGCCGGC CCCGGTCAGC CCCTGCGAAT TGAAGATCAT GCGGCGTTTC
GCGATTTCCG ATCGTGGCCG GGCCGGCGAC AAGTGGCTAA TAGAAACGGT GCGCTGGGCC
TGGCGGACCA AGATCCAGGT TCATCTCAAT GTTGATTTGC GGATGCGACT GTGGAACGCG
GCCGAGCAGG AGGCGGTACG CGTGTTCGCG TCGAATTTGC GCGACCTCCT GCTGGCGGCG
CCGGCGGGCG CGCGCGTAAC CATGGGTCTC GACCCCGGAT TCCGCACCGG CGTCAAGGTC
GCTGTTGTCG ATGCGACCGG CAAGGTGGTG GCGACCACGG CCGTCTATCC GCACGAGCCT
CAGCGGCAAT GGGATGCCAC GCTCGCCACG CTCGGCAAGC TCGCCGTTCA ACATCGCGTC
GACTTGATCG CGATCGGCAA CGGCACGGCG TCGCGCGAGA CCGACAGGCT CGCTATGGAT
CTGGTAAAAC TCCTGCCCGA TATGAAGATG TCGAAGATCG TGGTGTCGGA GGCCGGCGCT
TCGGTTTATT CGGCGTCGGC CTTTGCGTCC GAGGAACTGC CCGAACTCGA CGTCACGCTG
CGCGGCGCGG TATCGATTGC TCGACGACTG CAAGATCCAT TGGCCGAACT GGTCAAGATC
GATCCGAAGG CGATTGGGGT GGGACAATAT CAGCATGACC TTGGCGAGAG TAAGCTGGCT
CGCTCCCTCG ATGCGGTGGT TGAGGATTGT GTTAACGCGG TCGGAGTGGA TGCCAACACG
GCGTCTGCGC CGCTTCTGGC ACGCGTATCA GGGATCGGCG CGGGGTTGGC GCAAACCATT
GTCCAACACC GTGACGACAA TGGGCCGTTC AAGACACGGA AGGGGCTGAA GCAGGTGCCC
CGGCTGGGGC CGAAAGCGTT CGAGCAATGC GCAGGTTTCT TGCGGATCAC CGGCGGCGAC
GACCCGCTCG ATGCCTCGGG AGTCCATCCG GAGGCTTACC CCGTCGTGCG CAAAATTCTT
GCTGCAACCA GGAGCGACAT CAAGGCGCTG ATCGGAAACG TGGATATCCT GCGTCAGGTC
AAGCCGCAGA ACTTCGTTGA CGACACCTTC GGTTTGCCGA CCGTGGTCGA CATTCTGCGC
GAGCTGGAAA AACCCGGTCG CGATCCGCGC CCGGCCTTCA AGGCTGCGGT GTTCAAAGAG
GGAATCGAGA CACTCAAGGA TGTTAAACCC GGTATGATCC TCGAAGGTGC CGTAACCAAC
GTGGCGGCGT TCGGCGCCTT CGTCGATATC GGTGTCCATC AAGATGGCCT GATCCACATT
TCGGCGATGT CGAAGTCCTT TGTCAAAGAC CCGCGCGCGG TCGTAAAGTC GGGTGACGTC
GTCAAGGTGA AGGTATTGGA GGTGGATGTC GCCCGTAAGC GGATCGCATT GACGCTCCGT
CTCGATGACG AACTCGGCGG CAAGAGCGAG CGTGCGGCGC AAGGCCTCCC ACGAGACAAA
ACCCGCCCGA TGACATCGCC GCCGGCCAGG CCGCATCAGC AGTCCGGCGG CGGAGCGCTT
GCTGAAGCCC TGCGGCGCGC GACCGAGAAG AGCGGAAGCG GCAAGACTAA GTCATAG
 
Protein sequence
MANINRRIAE ELGVREQQVE ATVALLDGGA TVPFVARYRK EITGSLDDAQ LRTLEERLNY 
LRELEDRRVT ILNSVREQGK LDAALEAAIL AADSKGRLED IYLPFKPKRR TKAEIAKERG
LEPLAHLLLA EPQNDPKTVA EPFVNAEKSI ADVAAALDGA RAILVEHFAE DADLIGALRE
QMWSNGLMAS TVRSGKKTEG EKFKDYFDFS EPLTKLPSHR ILAMFRGEKE DILDLQMLPE
PVSATPAPVS PCELKIMRRF AISDRGRAGD KWLIETVRWA WRTKIQVHLN VDLRMRLWNA
AEQEAVRVFA SNLRDLLLAA PAGARVTMGL DPGFRTGVKV AVVDATGKVV ATTAVYPHEP
QRQWDATLAT LGKLAVQHRV DLIAIGNGTA SRETDRLAMD LVKLLPDMKM SKIVVSEAGA
SVYSASAFAS EELPELDVTL RGAVSIARRL QDPLAELVKI DPKAIGVGQY QHDLGESKLA
RSLDAVVEDC VNAVGVDANT ASAPLLARVS GIGAGLAQTI VQHRDDNGPF KTRKGLKQVP
RLGPKAFEQC AGFLRITGGD DPLDASGVHP EAYPVVRKIL AATRSDIKAL IGNVDILRQV
KPQNFVDDTF GLPTVVDILR ELEKPGRDPR PAFKAAVFKE GIETLKDVKP GMILEGAVTN
VAAFGAFVDI GVHQDGLIHI SAMSKSFVKD PRAVVKSGDV VKVKVLEVDV ARKRIALTLR
LDDELGGKSE RAAQGLPRDK TRPMTSPPAR PHQQSGGGAL AEALRRATEK SGSGKTKS