Gene Msed_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1991 
Symbol 
ID5103378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1927753 
End bp1928598 
Gene Length846 bp 
Protein Length281 aa 
Translation table11 
GC content47% 
IMG OID640507879 
ProductL-2-aminoadipate N-acetyltransferase 
Protein accessionYP_001192055 
Protein GI146304739 
COG category[H] Coenzyme transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) 
TIGRFAM ID[TIGR00768] alpha-L-glutamate ligases, RimK family
[TIGR02144] Lysine biosynthesis enzyme LysX 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.317152 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTAG CCCTGATTGT AGACATAGTT AGGCAAGAGG AAAAGCTTCT AGTTAAGGCG 
CTCAACGATA AGCAGGTTAG TTACGACGTA ATTAACGTGG CTCAAGAGCC ATTGCCGTTT
AACAGGGCGT TGGGTCGATA CGACGTAGCC ATAATTAGGG CAGTTAGCAT GTATAGGTCA
CTATACGCCG CTGCCGTCCT GGAGGGAACC GGGGTTCATA CGATTAACTC CACAGACGTG
ATAAGCGTAG CTGGAGACAA GATATTGACA TACTCCAAGC TCTTCAGGGC AGGAATACCC
GTTCCACAGT CCATCATAGC CATGTCACCG GACTCGGTGA TGAAGGCATA TGAACAGATT
GGATTCCCCC TCATTGACAA ACCACCAATT GGTAGCTGGG GAAGGATGGT CTCCCTCATC
AGGGATATCA TCGAGGGGAA GACCATAATA GAACACAGGG AAATGATGGG GAATTCCGCA
CTGAAGGTTC ACATAGTCCA AGAATACATT ACAGGGAAAA ACAGGGACAT AAGATGTATA
GTGATGGGGA ATGAGCTCCT AGGATGCTAC GCCAGGAACA TACCCTCTAA CGAATGGAGG
GCTAACGTTG CCCTTGGAGG AACTCCAACT CCTCTGGAGG TTGACGACGC GCTTAAGGAA
ACAGTGTTGA AGGCAGTTAA GGTGATAAAT GGCGAGTTTG TCTCAATTGA CGTGTTGGAA
CATCAGTCTA GGGGATACGT TATTAACGAG CTTAATGACG TTCCTGAGTT CAAGGGGTTC
ATGCTTGCCA CAGGGATAGA TGTGCCCAAC AGGCTGGTAG ATTACATAAA GGAAAAGTTT
TCCTAA
 
Protein sequence
MKVALIVDIV RQEEKLLVKA LNDKQVSYDV INVAQEPLPF NRALGRYDVA IIRAVSMYRS 
LYAAAVLEGT GVHTINSTDV ISVAGDKILT YSKLFRAGIP VPQSIIAMSP DSVMKAYEQI
GFPLIDKPPI GSWGRMVSLI RDIIEGKTII EHREMMGNSA LKVHIVQEYI TGKNRDIRCI
VMGNELLGCY ARNIPSNEWR ANVALGGTPT PLEVDDALKE TVLKAVKVIN GEFVSIDVLE
HQSRGYVINE LNDVPEFKGF MLATGIDVPN RLVDYIKEKF S