Gene Noca_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1743 
Symbol 
ID4598506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1852621 
End bp1854021 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content71% 
IMG OID639776343 
Productdiaminopimelate decarboxylase 
Protein accessionYP_922943 
Protein GI119715978 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.13647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACG AAGCCGGCTG GGCGCACGCG CCCGGCGCCC TCCGCGGTCC CGCCTGGCTG 
CAGGCGCCCG CAGATCCGGA CGCCCTGGTC TCGCACCTGT GGTCGTCGAC CGCGCACAAG
GTCGACGGCG TCCTGCACGT CGGCGGCCTG TCGGTCCCCG ACGTCGTCGC GAACGTCAAC
ACCCCGGCCT ACGTCCTCGA CGAAGCCGAC TTCCGGATGC GTGCGCGGGC GTTCCGCGAC
GCATTCGCGG CGTACGACGT CTTCTACGCC GGCAAGGCGT TCCTCTGCAC CACGGTCGCG
CGCTGGGTGC AGGAGGAGGG GCTGTGCCTC GACGTGTGCT CGAACGGCGA GCTCACCGTG
GCGCTGCGGG CCGGCTTCGA TCCGGCCCGG ATCGGCTACC ACGGCAACAA CAAGACCGTC
ACCGAGCTGC GCCGCGCGCT CGCGGCCGGA GTCGGGCGGA TCATCGTCGA TTCCTTCGAC
GAGCTGGCCC GGCTCGAGAT GATCACCGCC GACAGCGGCC TGCCCGCCCG CGTGATGATC
CGGGTGACGG CGGGCGTCGA GGCGCACACC CACGAGTACA TCGCCACCGC GCACGAGGAC
CAGAAGTTCG GGTTCTCGAT CACTTCCGGG GACGCGTTCG AGGCGGCCCG CCGGGCCGAG
GCCGCGCCGG GGATCGACCT GCTCGGCCTG CACTCGCACA TCGGCAGCCA GATCTTCGAC
TCCTCCGGGT TCGAGGTCGC GGCCCGCAGG GTGCTGGCCC TGCAGGCGCG GATCTCCCAG
GAGCTGGGGG TCACGCTGCC CGAGATGGAC CTCGGCGGCG GGTTCGGGAT CGCCTACACG
ACGCAGGACG ACCCGGCCGA ACCCGCGCAG CTGGCACCTG CGATGACCAA GATCGTCGAG
CACGAGTGCC GGGCCCTCGG CATCCCCGAG CCGCGCCTGT CGATCGAGCC CGGTCGCGCG
ATCGTCGGTC CGGCGATGTG CACCGTCTAC ACCGTCGGGA CCGTGAAGCC CGTCCAGCTC
GACGGCGGGG CGGTGCGCAC CTACGTCTCC GTCGACGGCG GGATGAGCGA CAACATCCGC
ACCGCCCTGT ACGACGCGGA CTACTCCTGC ACGCTGGCCT CCCGGCCGTC GACGATCGGC
GGCTCCGTGC TCGCCCGGGT CGTCGGCAAG CACTGCGAGG CCGGTGACAT CGTCGTCAAG
GACGAGTTCC TGCCCGCCGA CGTCCGCCCC GGCGACCTGG TGGCCGTCCC CGGCACCGGC
GCCTACTGTC GCTCGATGGC CTCCAACTAC AACCACGCGC TGCGTCCGCC GGTGGTCGCG
GTGCGGGACG GAGTCGCGCG CGTCGTGGTT CGCCGGGAGA CTGAGGACGA CCTGTTGGCA
ACGGACATGG GTGATCAGTG A
 
Protein sequence
MSHEAGWAHA PGALRGPAWL QAPADPDALV SHLWSSTAHK VDGVLHVGGL SVPDVVANVN 
TPAYVLDEAD FRMRARAFRD AFAAYDVFYA GKAFLCTTVA RWVQEEGLCL DVCSNGELTV
ALRAGFDPAR IGYHGNNKTV TELRRALAAG VGRIIVDSFD ELARLEMITA DSGLPARVMI
RVTAGVEAHT HEYIATAHED QKFGFSITSG DAFEAARRAE AAPGIDLLGL HSHIGSQIFD
SSGFEVAARR VLALQARISQ ELGVTLPEMD LGGGFGIAYT TQDDPAEPAQ LAPAMTKIVE
HECRALGIPE PRLSIEPGRA IVGPAMCTVY TVGTVKPVQL DGGAVRTYVS VDGGMSDNIR
TALYDADYSC TLASRPSTIG GSVLARVVGK HCEAGDIVVK DEFLPADVRP GDLVAVPGTG
AYCRSMASNY NHALRPPVVA VRDGVARVVV RRETEDDLLA TDMGDQ