Gene Noca_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1843 
Symbol 
ID4597163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1967130 
End bp1968479 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content73% 
IMG OID639776442 
ProductMmgE/PrpD family protein 
Protein accessionYP_923041 
Protein GI119716076 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.279739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACA TCCTCGACAC CATGACCGAG TGGGCCGCGG AGCTCACCTG GGACGACCTG 
CCCGAGGAGG TCCGGGAACG GGCCGGCTTC GCGCTGACCG ACACAGTGTC CACGATGGTC
GGCGGCGCTC CGACCGCGGC GGCGGTCATC GCGCTCGACT ACGCCGCCAC GGCCGGCGGC
TCCGCGCCGC TGGTCGGGCG GGGGGCCGCG ACCACCCCGG CCAACGCGGC CTTCGGCAAC
GGCGTCGCGG CGAGCGCGCT CGACTTTGAC GACGGGCACT ACCTCGCGGG CGCCATCCAC
CCTGGTTCCG TGATCGTCCC GGCGGTGCTC GCGGTCGCCG ACTCCGTGAC CACGGTCGCC
GACGCACTCG TCGCGCAGGT CGTCGGCTAC GAGATCGGCC TGCGGGCCGC GGCGATGCTC
TGGCCCAAGC ACGACCTGGA CCACTACCAC GCCACCGGCT GCGCCGGAGC GATCGGCGCC
GCGGCCGCGG CCGCCAAGCT GCTCGGGCTG GACGCCGACG GCCTCGCTTG CGCGATCAAG
ATCTCCTGGC TGCACGCACC GATGTCGACC TTCGGCACGC CGATGGTCAA GGAGTCGATC
GGCTGGGGTG CGTCCACGGG CGTCGCGGCG GCACAGCTCG CCGAGGCCGG CTTCATGAAG
GTCCCGGAGG GCTACGACAT CCCGGCCAAC GAGGTGCTCC CGCCGTCGCC GTTCCACCAG
CCAGGCGCGG CCGAGGACCC CTTCGTGACC AGCATCGGCA CCCGCTACGA GGTGCTGCAC
ACCTACTTCA AGTCCTTCGG CGCGTGCCGC TACACGCACG CCGCCGGAGC GGGCCTGCTC
TCCCTGCTCG CCGAGCACGG CATCGCCGCG GCCGACATCG CCCGCATCCG GGTGGGCACG
CACAAGGCGG CGACCTTCCT CGACGAGGTG GCGCCCAGGA CCATCGACAC GGCGCAGTAC
AGCTTCCCGA TCGTGCTCGC CTCGCTCGCT CTGTGGGGCG CTGCTGGAGC CGAGGAGATG
GACGCGTCCC GGCTCGACGA CCCGGAGCGG CTCGCGCTCG CCGGCAAGTT CAGCCTCGAG
CACGACGCCG ACCTCGACCA GCACTACCCG GCGCGCTACC CGAGCCGGGT CGAGGTCGAG
ACGACCGACG GCCGTACCGT TCGCGGCGTC TACCTGGACG GTCCCGGCGA CCCGGGCACC
TCGTTCGGAC CGGCCGAGCT CAGGCAGAAG TGGCAGCGGC TTCTCGGCGC GATGCTCGGC
GAGACGGGCG CCCAGGGTGT GCTGACCGGA CTCGGCGACC CCACGTCGAC GCTGCACGCC
GTCCTGGTGC CCGTGTGGGG GAGCAAGTGA
 
Protein sequence
MTDILDTMTE WAAELTWDDL PEEVRERAGF ALTDTVSTMV GGAPTAAAVI ALDYAATAGG 
SAPLVGRGAA TTPANAAFGN GVAASALDFD DGHYLAGAIH PGSVIVPAVL AVADSVTTVA
DALVAQVVGY EIGLRAAAML WPKHDLDHYH ATGCAGAIGA AAAAAKLLGL DADGLACAIK
ISWLHAPMST FGTPMVKESI GWGASTGVAA AQLAEAGFMK VPEGYDIPAN EVLPPSPFHQ
PGAAEDPFVT SIGTRYEVLH TYFKSFGACR YTHAAGAGLL SLLAEHGIAA ADIARIRVGT
HKAATFLDEV APRTIDTAQY SFPIVLASLA LWGAAGAEEM DASRLDDPER LALAGKFSLE
HDADLDQHYP ARYPSRVEVE TTDGRTVRGV YLDGPGDPGT SFGPAELRQK WQRLLGAMLG
ETGAQGVLTG LGDPTSTLHA VLVPVWGSK