Gene Sare_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4034 
Symbol 
ID5705014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4590526 
End bp4591938 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content70% 
IMG OID641273459 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001538815 
Protein GI159039562 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.84121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00505776 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGCGC ACGAGGCCGG TGCCCTGCAC GGGGAGATCG GCAGCCGTGG GCCGGCGTGG 
CTGCGTGCCC CGCTCGACGT GAACGCCCTG GTGCCGGGGC TGTGGCCGCG CACCGTCAGC
CGTGCGGCGG ACGGTGCGCT CGCGGTCGCG GGTTGTTCGG TGCGGGACCT CGCCGCCGAG
TTCGGCACGC CGGTCTACGT GTTGGACGAG GAGGACCTGC GGACCCGCTG TCGGGACTTC
CGTGCCGCTT TTCCGACCGA GGACGTCTAC TACGCCGGTA AGGCGTTCCT CTGCCGCGCG
GTGGTTCGGA TGATCGCCGA GGAGGGCCTG CACCTGGACG TGTGCAGCGG TGGCGAGTTG
GCCACCGCGC TGGCGGCCGG GATGCCGCCG GAGCGGATCG GCTTCCATGG CAACAACAAG
TCGGTCGCCG AGCTGAGCCG GGCGCTGGAC GCCGGGGTGG GCCGGATCAT CGTCGACTCG
TCCCACGAGA TCGACCGACT AACCGGGTTG GCTCGCGAGC GGGGCGTCCG CCCACGGGTA
CTGGTGCGGG TCACCGTCGG CGTGGAGGCG CACACCCACG AGTTCATCGC CACCGCACAC
GAGGACCAGA AGTTCGGCTT TTCGCTGGCG GGAGGCGCGG CGATCGCCGC CGTACTGCGC
ATCCTCGACG AGGACGTGTT GGAGTTACGT GGTCTGCACT CGCACATCGG ATCGCAGATC
TTCGACGCGA GCGGCTTCGA GGTCTCCGCC CGCCGGGTGC TGGCGCTCCA GGCGCAGATC
CGCGACGCGC GGGGAGTGCA GTTGCCCGAG CTGGACCTGG GTGGCGGCTT CGGCATCGCG
TACACGACAC AGGACGACCC GGCCACGCCG GCCGATCTGG CGAAGCGGCT ACGGAAGATC
GTCGACGGGG AGTGCGCCGC CGAGCGGCTG GCCGTGCCGC ACCTGTCCAT CGAGCCGGGC
CGGGCGATCG TCGGTCCGGC CATGTTCACG CTCTACGAGG TGGGCACGGT CAAGTCCGTG
CCGGTCGGCG CCGGTGGGGA CACCGCTGAC GGGCACCGCT GCTATGTGAG CGTCGACGGC
GGAATGAGTG ACAACATCCG GACCGCGCTC TACGACGCGT CCTACTCGGC GACGGTGGCC
TCCCGGGCGA GCGGTGCCGC GCCGGTGCTC GCCCGCGTGG TGGGAAAGCA TTGTGAGTCC
GGGGACATCG TGGTGAAGGA TGAGTTCCTG CCCGCCGACG TGCAGCCCGG AGATCTTGTC
GCGGTGCCCG GTACAGGTGC GTACTGCCGG AGCATGGCCA GCAACTACAA CCACGTGCTG
CGCCCGCCGG TGGTCGCGGT GCGCGACGGT CAGGCCCGCC TGATCGTCCG CCGGGAAACC
GAAGAGGATC TGCTCGCTTT GGATGTGGGA TGA
 
Protein sequence
MRAHEAGALH GEIGSRGPAW LRAPLDVNAL VPGLWPRTVS RAADGALAVA GCSVRDLAAE 
FGTPVYVLDE EDLRTRCRDF RAAFPTEDVY YAGKAFLCRA VVRMIAEEGL HLDVCSGGEL
ATALAAGMPP ERIGFHGNNK SVAELSRALD AGVGRIIVDS SHEIDRLTGL ARERGVRPRV
LVRVTVGVEA HTHEFIATAH EDQKFGFSLA GGAAIAAVLR ILDEDVLELR GLHSHIGSQI
FDASGFEVSA RRVLALQAQI RDARGVQLPE LDLGGGFGIA YTTQDDPATP ADLAKRLRKI
VDGECAAERL AVPHLSIEPG RAIVGPAMFT LYEVGTVKSV PVGAGGDTAD GHRCYVSVDG
GMSDNIRTAL YDASYSATVA SRASGAAPVL ARVVGKHCES GDIVVKDEFL PADVQPGDLV
AVPGTGAYCR SMASNYNHVL RPPVVAVRDG QARLIVRRET EEDLLALDVG