Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4085 |
Symbol | |
ID | 4596599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4314149 |
End bp | 4315579 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639778691 |
Product | MmgE/PrpD family protein |
Protein accession | YP_925269 |
Protein GI | 119718304 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTGAGC CCGCGAGCCA GACCCTCGGC GAGCGGATCG CCGAGTTCGC GACCGACGCC GCCGCGAACG GCGTACCGAA CGACGTCGCC GCCAGCGTCC AGCAGCGGAC ACTTGACGTC CTCGGCCTGT GCGTCGCGGC CCACCGGCTG GATACCAGCG CCGCGATCAT CGAGCACGTG CTCGACCAGG GCGGCCACGA GCAGGCCAGC CTCGTCGGTC GCCCCGAGCG GGTGACCGCG GCGCAGGCGG CGCTCGTCAA CGGGGTGCTC GCGCACTCGC TCGACTACGA CGACACCCAC CTGCCGTCGA TCCTGCACCC GAGCGCCAGC GTGGTGCCCG CCGCCCTGGC CGCGGCCGAG CACGCCGGAG CGAGCGGGGA GCTCACGGTC CGCGCGATCG CGGTCGGGCT GGAGGTCGCC GTACGCCTCG GCATGGCGGG GTACGACGAG AAGCTCGGGA ACTCGGTGTT CTTCGAGCAT GGACAGCACG CCACCTCGAT CACGGGCGCC ATGGGCTCGG CGGTCGCGGC GTCGCTGGTC TACGGCGCCG ACCGCGGCGC GGTCCTGCAC GCGCTCGGGC TGACCGCGTC CATGGCGTCC GGGGTGATCG AGGCCAACCG CTCCGGCGGC ACGGTCAAGC GCCTGCATTG CGGCCTGGCC GCGCAAGCCG GTGTCACCGC CGCTCAGCTC GTGCGCCGGG GCTTCACCGG CCCGGCCACC GTCTTGGAGG GCCGCTTCGG CTTCTTCCAG GCCTGGTTGC ACGGCCAGTT CTTCCCGGTC GCCGTCACCG AGGGACTCGG CGAGGAGTGG TCGGTGCCCG GCATCTTCTT CAAGCCCTAC CCGGCCAACC ACTTCACCCA CACCACGGTC GACGCCGGAC GCGCCTTCCG CGACCGGGGC GTGCGCCCCG AGGACATCGC GTCGGTCACG ATCGGAGCGC CGACAGCGGT GATCCGCACG ATCGGCCAGC CGATCGACGT CAAGCGCGCA CCGCAGACCG GATACCAGGC CCAGTTCTCG GGACCGTACG CGTTCGTCGC CGGCCTCTTC GGCGGCTCCG GCTTGGGCAC CGGCCTCGAC GACTACACGG ACGCCCTCGC CCAGGACCCC GCCCGCCGCG CGGTGATGGC CAAGGTCGAC GTCGTCCCCG ACGAGCGCTG CGACGGGATC TACCCCTTCC AGTTCCCGGC GGTCGTCACG CTCCGCACCA CGTCGGGCGA GGAGCTGGTC GAGGAGGTCC TCTCGAACCG CGGCGGCCCG GCCCGCCCGT TGAGCGACGA CGAGCTCGCC ACGAAGTTCC GCGACAACGT CGCCGGTCGC CTCGGGGCGG CCGCTGCCGA CGCCGTGCGT CGTAGCGCCC TCGACCTGCG TCATGCCAGT GGCGTCGCCG ACCTGCTGAC CCCCCTTTCG AACCTGGAGG AGAACCGATG A
|
Protein sequence | MAEPASQTLG ERIAEFATDA AANGVPNDVA ASVQQRTLDV LGLCVAAHRL DTSAAIIEHV LDQGGHEQAS LVGRPERVTA AQAALVNGVL AHSLDYDDTH LPSILHPSAS VVPAALAAAE HAGASGELTV RAIAVGLEVA VRLGMAGYDE KLGNSVFFEH GQHATSITGA MGSAVAASLV YGADRGAVLH ALGLTASMAS GVIEANRSGG TVKRLHCGLA AQAGVTAAQL VRRGFTGPAT VLEGRFGFFQ AWLHGQFFPV AVTEGLGEEW SVPGIFFKPY PANHFTHTTV DAGRAFRDRG VRPEDIASVT IGAPTAVIRT IGQPIDVKRA PQTGYQAQFS GPYAFVAGLF GGSGLGTGLD DYTDALAQDP ARRAVMAKVD VVPDERCDGI YPFQFPAVVT LRTTSGEELV EEVLSNRGGP ARPLSDDELA TKFRDNVAGR LGAAAADAVR RSALDLRHAS GVADLLTPLS NLEENR
|
| |