Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2121 |
Symbol | |
ID | 3918784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2260265 |
End bp | 2261365 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444874 |
Product | GTP-dependent nucleic acid-binding protein EngD |
Protein accession | YP_497394 |
Protein GI | 87200137 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0012] Predicted GTPase, probable translation factor |
TIGRFAM ID | [TIGR00092] GTP-binding protein YchF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.328757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTTC GTTGCGGGAT CGTCGGTCTT CCCAATGTCG GCAAGTCGAC GCTGTTCAAT GCGCTGACCG AAACGCAGGC GGCGCAGGCC GCCAACTATC CGTTCTGCAC GATCGAGCCC AACGTCGGCA ACGTCGGCGT CCCTGATCCC CGGCTCGACA AGCTGGCCGA GATCGCTGGC AGCCAGAAGA TCATCCCCAC CCAGCTCGGC TTCGTCGACA TCGCCGGCCT CGTGCGCGGG GCATCGAAGG GCGAAGGCCT CGGCAACCAG TTCCTCGGCA ACATCCGCGA AGTGGACGCC ATCGTCCACG TCCTGCGCTG TTTCGAGAAC GACGACATCC AGCACGTCGA CAACAAGGTC GATCCTATCT CCGACGCCGA GACGGTCGAG ACCGAACTGA TGCTGTCGGA CCTCGAAAGC CTCGAGAAGC GCGTTCCCGC CGCCGAAAAG AAGGCCAAGG CGGGCGACAA GGAATCGAAG ATCATCGCCT CGGTCCTCGG CCAGGCGCTC GAACTTCTGC GCGACGGCAA GCCCGCTCGC CTCACCCAGC CGAAGGATGA CGAGGAAGCG CGCGTCTTCA AGCAGGCCCA GCTCCTCACC GCCAAGCCCG TTCTCTACGT CTGCAACGTC GAGGAAGAAA GCGCGGCGAA CGGCAACGCC TTCTCCGCCC GCGTCTTCGA AAAGGCCAAG GCCGAAGGCG CCAACGCGGT GATCGTTTCG GCCGCGATCG AATCCGAACT CGTCGGCATG GACCCCGAGG AACGCTCCGT TTTCCTCGAG GAAATGGGCC TGCACGAAAC CGGCCTCGCC CGCGTGATCC GCGCCGGCTA CGAGCTGCTT CACCTCATCA CCTTCTTCAC CGTCGGCCCC AAGGAAGCGC GTGCATGGAC CGTGCACCTT GGCGCAAAGG CGCCCGAAGC CGCCGGTGAG ATCCACTCCG ACATGCAGCG CGGCTTCATC CGCGCCGAAA CCATCGCCTA CGACGATTTC GTCAGCCTCG GCGGCGAAAG CGCCGCGCGC GATGCCGGCA AGCTGCGCCA GGAAGGCAAG GAGTACGTGG TGAAGGACGG CGACGTCCTC CACTTCAAGT TCAACGTCTG A
|
Protein sequence | MGFRCGIVGL PNVGKSTLFN ALTETQAAQA ANYPFCTIEP NVGNVGVPDP RLDKLAEIAG SQKIIPTQLG FVDIAGLVRG ASKGEGLGNQ FLGNIREVDA IVHVLRCFEN DDIQHVDNKV DPISDAETVE TELMLSDLES LEKRVPAAEK KAKAGDKESK IIASVLGQAL ELLRDGKPAR LTQPKDDEEA RVFKQAQLLT AKPVLYVCNV EEESAANGNA FSARVFEKAK AEGANAVIVS AAIESELVGM DPEERSVFLE EMGLHETGLA RVIRAGYELL HLITFFTVGP KEARAWTVHL GAKAPEAAGE IHSDMQRGFI RAETIAYDDF VSLGGESAAR DAGKLRQEGK EYVVKDGDVL HFKFNV
|
| |