Gene Rcas_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1034 
Symbol 
ID5538500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1350031 
End bp1351377 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content59% 
IMG OID640893173 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001431156 
Protein GI156741027 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACA CGACTCAAGA AGGCATAAAC AACGCACACC TCTGGCCCCT GACGACGGTA 
GTCGATATGC ATGGGCGACT GATGATCGGC GGCTGCAATG TGGCGACTCT GGCGCGTCAG
TACGGCACGC CGCTCTACCT GTTCGATGAA GAGACCATTC GCAGCGCCTG CCGCACCTGG
CGAACGGCGC TTGCAGCCTG TTACCGGGGG GAGACGGCAG TTCATTACGC GAGTAAAGCG
TTGCTCAACA CTGCGCTCAC ACACCTGATC GCTGATGAAG GGCTGGGGCT GGACGTAGTC
TCAGGCGGTG AGTTATACGT GGCGCTCCGC GCCGGCTTCC CGCCGCAGCG CATTCATATG
CACGGCAACG CCAAAACGCG CGCCGAACTG GAACAGGCGC TGGCCTCCGG AATCGGACAG
ATCATTGTCG ATAATCTCGA TGAACTGGCG ATGTTGGCGA ACCTGACCGC ATATCGTTCA
CCACCACAAC CGATTGCGTT GCGCATTGCA CCGGATATCG TCACCAATAC GCACGCCCAT
ATTCAAACCG GTCACGCGAC ATCGAAGTTC GGTCTACCAC TTGATGCACT CGATGCCGCC
GCCGAACGGT TACGCACTGC GCCCGGTCTG TGCCTGATCG GGTTACACGC TCATCTCGGG
TCGCAACTCT TCGACCTGGA ACCATATGCC GCTGAGATCG ATACGCTGCT CGACAGCGCC
TCGCGCCTGC GTGATCGCCA CGGTTTCATT ATTCAGCAAA TCAACATCGG CGGAGGAGCA
GGAGTGCCAT ACACTGCGGA TCAGCACCCC TTCGATGTAC ACGCTCTTGC GATGAGATTG
GGAGAAGCGC TCACCGATGA ATGCCGCCGA CGCGGGTTTC CCCTGCTGCA CCTGGTGATC
GAACCAGGAC GTTCAATCAT CGCGCGCGCA GGGGTAGCGC TCTATACGAT TATCGCAACA
AAGAATCTTC CGCATATGCG ATTCCTACAT ATCGACGGCG GCATGGGCGA CAATATTCGT
CCGGCGCTCT ACGGCGCGCG GTATAGTGCG GTGCTGGCAG AACGGGCGAA TGCGCCAATC
GAAGAGAGCG TAGCGATTAC CGGGCGCTAC TGCGAATCGG GCGATGTGTT GATCCATGCC
GCACCGCTCC CGCGCGCCAG CGTTGGCGAC ATTCTGGCAG TTCCTGTGGC GGGCGCCTAC
ACGCTGAGCA TGGCCAGCAC ATACAACCTG ACTCCACGTC CGGCGGTTGT CATGGTGAAT
GGTGGGTCAG TACGTCTCAT TCAGCGCCGC GAAACGTATG AGGATATGAT TGCCAGGGAT
GTGGTGTCGT CGCAGGGGCA GGTCTGA
 
Protein sequence
MNHTTQEGIN NAHLWPLTTV VDMHGRLMIG GCNVATLARQ YGTPLYLFDE ETIRSACRTW 
RTALAACYRG ETAVHYASKA LLNTALTHLI ADEGLGLDVV SGGELYVALR AGFPPQRIHM
HGNAKTRAEL EQALASGIGQ IIVDNLDELA MLANLTAYRS PPQPIALRIA PDIVTNTHAH
IQTGHATSKF GLPLDALDAA AERLRTAPGL CLIGLHAHLG SQLFDLEPYA AEIDTLLDSA
SRLRDRHGFI IQQINIGGGA GVPYTADQHP FDVHALAMRL GEALTDECRR RGFPLLHLVI
EPGRSIIARA GVALYTIIAT KNLPHMRFLH IDGGMGDNIR PALYGARYSA VLAERANAPI
EESVAITGRY CESGDVLIHA APLPRASVGD ILAVPVAGAY TLSMASTYNL TPRPAVVMVN
GGSVRLIQRR ETYEDMIARD VVSSQGQV