Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4028 |
Symbol | |
ID | 4596542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4252608 |
End bp | 4253459 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639778634 |
Product | ethanolamine utilization protein EutJ family protein |
Protein accession | YP_925212 |
Protein GI | 119718247 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4820] Ethanolamine utilization protein, possible chaperonin |
TIGRFAM ID | [TIGR02529] ethanolamine utilization protein EutJ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.31103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCAGA CCGCGCTGCG CGACACCATG GCCGCCCTCG AGGCGGCCAT GGTTAGCACC CCCCTCGACC GGCCACCCAC CCTGATCAAG GGGGGCGTGG ACCTCGGCAC GGCGTACCTG GTGATGGTCG CGCTGGACGC CGACGACCGT CCGCTCGCGG CGGCGTACGA GACCGCCGAC GTGGTCCGAG ACGGCGTCGT CACGGACTTC GTGGGCGCCA TCGACGTGCT CCGCCGGCTC AAGGCGCAGG TCGAGGACCG GCTCGGGGTG GCCGTCCCGG GCGCGCACGG CGCCTACCCG CCCGGAGTGG ACTCGGGCTC GGTCCGCGCG GTGCGCCACG TGATCGAGTC CGTGGGCATG GAGTGCACCG GCCTCGTCGA CGAGCCGAGC GCCGCCAACG CGGTGCTTCG CTTGCGCGAC GGCGTCGTGG TCGACATCGG CGGCGGCACG ACCGGGGTGG CCGTCGTGCA GGACGGCACG GTCGTGCACA CCGCCGACGA GCCGACCGGG GGGACGCACC TGAGCCTGGT CATCTCCGGC GCGCTCAAGG TGTCCTTCGA GGAGGCCGAG CGGCTGAAGA AGGACCCCGT GGAGCAGCCC CGGCTGTTCC CCGTGATCCG GCCGGTGATG GAGAAGGTGG CCTCGATCGT GTCGAGCAGC ACCCGTGGCT GGCCGACCCC GAAGGTCTAC CTCGTCGGCG GGACGGCCGC ATTTCCCGGG TTCGCCGACG TGGTCGCGGC GGCCACCGGG CTCGACGTCG TCGTGCCGGT GGCGCCCCTG TTCGTCACCC CGCTCGGCAT CGCGCGCAGC GCGCCCGCCC TCGAGGCGGG CGGCGCAGCG GGCTACCCAT GA
|
Protein sequence | MDQTALRDTM AALEAAMVST PLDRPPTLIK GGVDLGTAYL VMVALDADDR PLAAAYETAD VVRDGVVTDF VGAIDVLRRL KAQVEDRLGV AVPGAHGAYP PGVDSGSVRA VRHVIESVGM ECTGLVDEPS AANAVLRLRD GVVVDIGGGT TGVAVVQDGT VVHTADEPTG GTHLSLVISG ALKVSFEEAE RLKKDPVEQP RLFPVIRPVM EKVASIVSSS TRGWPTPKVY LVGGTAAFPG FADVVAAATG LDVVVPVAPL FVTPLGIARS APALEAGGAA GYP
|
| |