Gene Rcas_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3854 
Symbol 
ID5541358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5037528 
End bp5038532 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content60% 
IMG OID640895964 
Productsuccinylglutamate desuccinylase/aspartoacylase 
Protein accessionYP_001433909 
Protein GI156743780 
COG category[R] General function prediction only 
COG ID[COG3608] Predicted deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.795222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAA CGCAGATCGT GACCATGATC GATTTCGAGA AGCCCGGCAA GCAAGTGGGA 
CAGTTGGCGA TTCCACAATC AACCAACACA TCGGGATGGG CAACCGAATA TCTGCCCATT
GCCGTGATCA ATGGCGCACC CGGACCAACG GCGCTGTTGT TCGGCGGCAA CCATGGCGAT
GAGTATGAAG GACCGGTGAC GCTCCTGAAC ATGGCGCACA CCCTCGAACC AGACAATCTC
TGCGGGCGTG TCATCATCGT TCCGATGCTG AACCGTCCGG CGCTTGCCGC CGGAACCCGA
CTGTCGCCGC TCGACGGCAA GAACATGAAT CGGGTGTTTC CGGGACGCGC CGATGGAACG
ATCACCGAGA TGATTGCTCA TTACGTAACG ACGGTGCTTT TCCCACTGGC GGACCTGGTG
ATCGATATTC ATTCCGGCGG ACGGTCGGCG CACTTTCTGC CGCTCGTCAG CATGCACCAT
GTTCCCAACC ACGAACAATT GCGATCCATG ATCGATCTGG CGCTGGCATG GGGTGCGCCA
TATGTGCTGC TCTACCGGGA TGTCGGCGGA ACCGGGCTGC TCCCCGGCGA AGCCGAACGC
CTGGGAAAAC TGACCCTTGG CACTGAAATG GGCAGCGCCG CGCAGTTTGG CGTCGATATG
CTCAGCCTGA CGGAACGCGG CGTGCGCAAT GTGTTGCGTC AGGCACGCAT CCTGACCGAT
CAGACGCCCG ACCCGCCGGC GCCCGCGAAG ATTATGGCTG CCGATCAGTA TGATGATTAC
ATCATGGCGC CGGTAAGCGG CATCTTCGAA CCTTTCGTCG AAATGGGCGC GTGGATGGTC
GCGGGACAGG CGATCGGGCA GATCCATTCT ATCGAGCAAC CCTTCGCTTT GCCAACGCTG
GTGTACGCCA GAACAGACGG CATGCTGATC AGCCGGCGCG CGTTTCCCCT CGTTCGCCAG
GGCGACTGCC TTGCCACACT TGCACGCCCG TTTCACCTGC CATAG
 
Protein sequence
MSATQIVTMI DFEKPGKQVG QLAIPQSTNT SGWATEYLPI AVINGAPGPT ALLFGGNHGD 
EYEGPVTLLN MAHTLEPDNL CGRVIIVPML NRPALAAGTR LSPLDGKNMN RVFPGRADGT
ITEMIAHYVT TVLFPLADLV IDIHSGGRSA HFLPLVSMHH VPNHEQLRSM IDLALAWGAP
YVLLYRDVGG TGLLPGEAER LGKLTLGTEM GSAAQFGVDM LSLTERGVRN VLRQARILTD
QTPDPPAPAK IMAADQYDDY IMAPVSGIFE PFVEMGAWMV AGQAIGQIHS IEQPFALPTL
VYARTDGMLI SRRAFPLVRQ GDCLATLARP FHLP