Gene Msed_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1823 
Symbol 
ID5105386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1769843 
End bp1770952 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content55% 
IMG OID640507722 
Productsuccinyl-diaminopimelate desuccinylase 
Protein accessionYP_001191901 
Protein GI146304585 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01910] acetylornithine deacetylase or succinyl-diaminopimelate desuccinylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.248504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTC TAAGGGAGCT CGTGGAGATA GAGACCGTGA ACCCTCCCGG GTCTCATTAC 
GAGGAATTTA CGTCGGTGAT GAGGGAGAGA CTTGGGGAAC TGGGATTTCA GGTAGAGCTC
GTGGAGATTC CAGACGAGTT CCTGGACAAG AACTACATTT ATTCCCCTAG GCACAGGGGG
AACAAGAGGG TCATACTCCT CGCGAGGAAT GACCCTGAAC CCAGGCTTCA CTTCAACTTC
CATTACGACG TGGTTCCTGC AGGGAACGGT TGGGTGACTG ATCCCTTCAA GCTGAAGGTA
GTTGAGGACA GGGCATACGG GAGAGGGACT TCTGACATGA AGGGGGCCAT CGCGAGCCTT
TACCTCGCGT TATCGGGTCA GGACTTCCCC GTAGAGGTTG CGCTTGTACC TGATGAGGAG
AGCGGAGGGC TAGGAACCAG GTACCTTGTG GATAAACTTC GGGTCAGGCC GAGACACGTG
ATCCTAGGCG AGCCGAGCTT CCCCGACCTG TACGTGGGTC ATTTCGGGAT CGTCCGGGGA
GTTGTGAGGG TGTTCGGGAA ACAGGTCCAC GCCAGCATGG CAAACCAAGG AGTTAACGCC
TTCCTTGAGG CCTCTAGGTT AGCCCTGGAG CTTCAGAGGA GGTACTCCTC GCTCTCGCTC
TCGCTCGAGG GATCAACGGT GCTCGGCGGA TACGTCGAGG GTTCAACGAG CGACGGGATG
GTTCCAGGGA CATTCGCCTT CAGTTTCTAT AGGTCAGTCC CACCAAAGGG AAGGGGTCCG
GACCTCGATC ATGAGATCGT GGACGAGACG GCCAGGGAAC TGGGGATCAA GCACGAGTTC
GAGATTAAGT CCTTCGTACC GGGTTCAATG ACCAGTCCTG ATTCCAGCTT GACGAGAGTC
GTCGAGGCGT GTATTAGGGA GATGGGCTGG GAACCTAGGA AGGAGGTGGC GAAGATTAGA
TATGACGCGG TATTCTACGG AGATATTGAC GCCGTGAACT TCGGCCCAGG GGAGCCGGGG
CAGGCCCACG TTGCGAATGA GTATGTTGAC CTTAGAAACG TAAAAAGGGT AAGCCAAGTA
TATAGTTGCG TGATGAGATC CATGTTGTAG
 
Protein sequence
MNLLRELVEI ETVNPPGSHY EEFTSVMRER LGELGFQVEL VEIPDEFLDK NYIYSPRHRG 
NKRVILLARN DPEPRLHFNF HYDVVPAGNG WVTDPFKLKV VEDRAYGRGT SDMKGAIASL
YLALSGQDFP VEVALVPDEE SGGLGTRYLV DKLRVRPRHV ILGEPSFPDL YVGHFGIVRG
VVRVFGKQVH ASMANQGVNA FLEASRLALE LQRRYSSLSL SLEGSTVLGG YVEGSTSDGM
VPGTFAFSFY RSVPPKGRGP DLDHEIVDET ARELGIKHEF EIKSFVPGSM TSPDSSLTRV
VEACIREMGW EPRKEVAKIR YDAVFYGDID AVNFGPGEPG QAHVANEYVD LRNVKRVSQV
YSCVMRSML