Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2373 |
Symbol | engA |
ID | 3784964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2698751 |
End bp | 2700151 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812462 |
Product | GTP-binding protein EngA |
Protein accession | YP_413054 |
Protein GI | 82703488 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.537703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCCA CCCTCGTACT GGTAGGGCGA TCCAACGTCG GCAAGTCCAC GCTCTTTAAC CGTTTGACAC GCAGCCGCGA CGCGCTGGTG GCCGACCTGC CGGGGTTGAC GCGCGACCGT CATTACGGAC ACGGCAAACT GGGTGACAGG CCGTATCTCG TGGTCGATAC AGGAGGCTTC GAGCCGATGG CAACGGAAGG CATCCTGCAC GAAATGGCGA AGCAGACACT GCAAGCAATT GACGAGGCTG ATGTCGTGCT CTTTATCGTG GACGGTCGAA GCGGTTTGAC GGCGCAGGAC AAAATTGTCG CCGAGCAACT GCGCAGATCG GGTCGCCGAA CCTTGCTGGC GGTAAACAAG ACCGAAGGCA TGGCTGTTTC CGTCGTTACT GCGGAGTTTC ACGAACTGGG ATTGGGCGAG CCTTGCGCGA TTTCCGCCGC CCATGGCGAC AACGTGAATG AACTGGTGAC ACTGGCGCTT CAGGATTTTC CCGACGAACC TGAGCAGGAA AGAAAAGACG ACCATCCGAA AATCGCCATC GTGGGTCGTC CCAATGTAGG AAAATCAACG CTCGTGAACA CCCTGCTGGG AGAGGAGCGT GTCATCGCCT TTGATCAGCC GGGAACTACG CGTGACAGCA TTTATATCGA TTTTGAGCGG AATGGGCGCA CTTATACCCT GATCGATACA GCGGGCCTGC GTCGACGCGG CAAGGTGCAG GAGACCGTGG AGAAGTTTTC CGTGGTGAAA ACACTGCAAG CGATAGAAGA TGCCAACGTG GTGATACTGG TGCTGGATGC AGCCAGTGAA ATTTCAGATC AGGATGCGCA TATTGGCGGA TTCATCCTGG AAGCAGGACG GGCACTGGTG CTGGCCGTGA ACAAGTGGGA CAGCCTGGAT GAGTACCAGC GTGACATGAT CAAGCGCGAT ATCAACCGTA AATTGCCGTT TCTGCAGAAT TTCGCCCGGT TTCACTATAT TTCGGCGCTA CATGGCACTG GCACGAAAGG GTTGCTGCCC TCTGTCGATG CCGCCTATGG GGCGGCGATG GCTCATCTGC CTACTCCCAG GCTTACGCGC ACATTATTGG CCGCGGTGGA GAAGCAGCCT CCCCCGCGTG CCGGCATGTC GCGTCCCAAG CTGCGCTACG CCCATCAGGG CGGTTCGAAT CCGCCCCTGA TTATAATCCA TGGCAGCGCT CTCAATGCCG TGCCCCAGAC CTATCAGCGC TATCTGGAAA ATACATTTCG CGATACCTTC GGGCTGGAGG GAACGCCGCT CCGGATAGAA TTCAGGACAG GCCGCAATCC CTACGCAGGG AAAAGCCCCG CTCCGCTCAC CGAAGCCGAG GCAAAACGGG CTCATCGTCG TCGACGATAC GGGCGGAAGA AGTATGGGTA A
|
Protein sequence | MKPTLVLVGR SNVGKSTLFN RLTRSRDALV ADLPGLTRDR HYGHGKLGDR PYLVVDTGGF EPMATEGILH EMAKQTLQAI DEADVVLFIV DGRSGLTAQD KIVAEQLRRS GRRTLLAVNK TEGMAVSVVT AEFHELGLGE PCAISAAHGD NVNELVTLAL QDFPDEPEQE RKDDHPKIAI VGRPNVGKST LVNTLLGEER VIAFDQPGTT RDSIYIDFER NGRTYTLIDT AGLRRRGKVQ ETVEKFSVVK TLQAIEDANV VILVLDAASE ISDQDAHIGG FILEAGRALV LAVNKWDSLD EYQRDMIKRD INRKLPFLQN FARFHYISAL HGTGTKGLLP SVDAAYGAAM AHLPTPRLTR TLLAAVEKQP PPRAGMSRPK LRYAHQGGSN PPLIIIHGSA LNAVPQTYQR YLENTFRDTF GLEGTPLRIE FRTGRNPYAG KSPAPLTEAE AKRAHRRRRY GRKKYG
|
| |