Gene Noca_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0040 
Symbol 
ID4600091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp45212 
End bp46573 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content68% 
IMG OID639774655 
ProductMmgE/PrpD family protein 
Protein accessionYP_921277 
Protein GI119714312 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGACTG TCTTGAGCAC CTACACCGAG CAGCTGGCCG AGTACGTCAC CGCCACGACC 
TACGACGCGC TGCCCACCTC GACCGTGGCC GCCGCCAAGC GCGTGACGCT CGACCTGATC
GGCGTCGTCC TGCCGGCCAT CAACTACGGC CCCGGCAGCG TGATGAACCA GTACGTGCGG
GAGACCGGCG GCCCCGGTCA GGCCACCGTC GTCGGCACCG ACATCAAGAC GAACGCCGCC
AACGCCGCCC TCGCCAACGG GACGATGGCG GCGGACATGG AGCAGGACGA CGTCCACCCC
GAGTCGAACC TGCACGCGAG CAGCGTCTTC GTCCCGGCGA TGCTGGGCGT TGCCGAGGAG
CTCGGTTCCT CGGGCCGCGA CTGGATCAAC GCCCTGGCCG TCGCCTACGA CGTCGGCTGC
CGGATCTCCA TCGCGATGGA CAACGGCCGG CAGTACGCGA GCGGCTTCCA CCCGACGGCA
GTCTCCGGCA CCTTCGGTGC CGCGGCCGCG GTGGCACGGC TCCTCGGCCT CGACGCCGCC
GGTGTCAACA GCACCATCGG TCTCACCGGC TGCCAGGCGG CCGGCATGCT CACCTGGGAG
ATGGAGACCG AGCACTACAC CAAGTCCTTC CAGAGTGGGG TTCCGGCGCG CAACGCGGTC
GTGGCCGCGC AGCTCGCCGC CCGGGGCTAC GTCGGCGCCA GCAACACCCT CGACGGGAAG
TACAACGTCT TCGACGCGTT CTCCAACCAC CGGAACTTCT CGCGGCTGGT GGAGAACCTC
GGCGACCGCC ACGAGATCGA GTACACCGGG TACAAGTTCT ACTCGGTGTG CCGCTTCATC
CACTCAGCCA TCGACCAGTT GCTCGATCTG TCCGCGGAAC ACGGTTTCGC GGGCGCCGAC
ATCGAGAGCC TCGACGTCTG GCTGCCGCAC ACGCAGGTGC CGATCGTCGA CCACAACACG
CTGATCACCC ACAACCTCCA GTACTCGCTC GCGGTGGGTA TCACCGACCG GGTCGTCGAG
CGCGCACAGA CCTCGAACGA GCGCTTCGCG GACCCCGCGC TGCAGGCGAT CGCGGCGAAG
GTGACCCTTC GCGGGGCCGA CGACCTGGAG GCCCTCTACC CCGCCCACTG GCCCTCGCGC
GTGCACATCC GCCTCACCGA CGGCCGGACC TTCGACAGCG AGAAGCACGA CCCGCGGGGC
ACCTCGTTCG TACCGGTGAC CGACGCTGAC ATCGTCGCGA AGTTCGAGGG CATGGCCTCC
CAGGTCCTGC CCGCAGAGCG GGTCAACCAG ATCGTCAAGA TCGTCGACGA GCTCGAGACC
CTCGACTCCA TCCGCGAGCT AACGGCCCTG CTGGTGCCGT GA
 
Protein sequence
MRTVLSTYTE QLAEYVTATT YDALPTSTVA AAKRVTLDLI GVVLPAINYG PGSVMNQYVR 
ETGGPGQATV VGTDIKTNAA NAALANGTMA ADMEQDDVHP ESNLHASSVF VPAMLGVAEE
LGSSGRDWIN ALAVAYDVGC RISIAMDNGR QYASGFHPTA VSGTFGAAAA VARLLGLDAA
GVNSTIGLTG CQAAGMLTWE METEHYTKSF QSGVPARNAV VAAQLAARGY VGASNTLDGK
YNVFDAFSNH RNFSRLVENL GDRHEIEYTG YKFYSVCRFI HSAIDQLLDL SAEHGFAGAD
IESLDVWLPH TQVPIVDHNT LITHNLQYSL AVGITDRVVE RAQTSNERFA DPALQAIAAK
VTLRGADDLE ALYPAHWPSR VHIRLTDGRT FDSEKHDPRG TSFVPVTDAD IVAKFEGMAS
QVLPAERVNQ IVKIVDELET LDSIRELTAL LVP