Gene Msed_1699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1699 
Symbol 
ID5105345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1638424 
End bp1639755 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content49% 
IMG OID640507593 
Productaspartate kinase 
Protein accessionYP_001191778 
Protein GI146304462 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.33468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.21751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGTGG TTAAGATTGG CGGATCAATT CAGAAGGATG AAAGGGACTT CGAACTCATA 
GCCAACAGGA TAAGCCAATA CTCATCGAGG GATAGGACCA TTGTGGTAAC CTCGGCCCTC
AAGGGAGTGA CCAACAGCTT GATTGAAGCC ACGGAGAATA GGGATAAGGC GGTGGAAATA
GTTAGCGAGA TATATGATAG ACACGTCAAG CTCTTGTCCA AGATTGTGGA CGGACCTGAA
TTTGACAATG CGTTCAAGTC CATATCGAAG CTGGCGGACG AGCTATTCAG GATAGCTTGG
TCCATCAAGG TGCTCGATGA GATATCGGCG AGAGTGAGGG ATTACATCCT CTCCTTTGGT
GAAAGGATGG CTAGCGTAAC CTTAGGGGCC ATGTTAAGAA GCAGGAAAAT GGACGCTCAA
TCCTACCCAG AACCATTACT TGTAACGGAT GACTCGTTCG GAGAGGCCAA CGTTATAGAA
GATCTCTCCG CCATGGAGGC AAGGAAGGTC CTAGATATTC CATCTAAGAT TGTTGTGGTT
CCAGGTTTCA TAGGCAAGAC CCCAACGGAG AGATACACTA CTGTTGGGAG GGGAGGAAGC
GACTATACTG CTACTCTTCT CGGTAAGTTA CTAGGATTTC CAGAGGTTAG GCTAGTAACC
GAGGTTCCAG GTATCATGAC TGCAGATCCT AGGAAGTTCC CCGGGGCCAA GACCATATCT
AGGCTCTCCC TAGAGGAGGC CATGGAGCTG GCGCAAATGG GTGCCAAGAG GCTCCATCCA
AGAACTTTCG AGCCCATGTT CGATAGGGAT ATAAGGGTTT ACATAGAGGG GCTCTACGAC
GAGGGTTATA CCCTGGTCCA GGGAACGTGC GACTCGTCAG ATAAATTGAA GGGAATAGCG
GTTCTCGACG ACTTAAAGCT AATCTCCGTG GAGAGCACTA ACATTGTGGG GAAGATAGGT
TCGGCAGCTA GGGTAATGGA AAAGGCTAGA GAGGCGGGAG TTAACATCAT TTCGTTATCT
CAGCCAGCCT CAGAGACCAC CATTCACATC GTGGTTGACT CCAAGAACGC AGAAAGGCTA
TCCTCTCGGC TACAGGAGCT AAGGGATGTG GATAGCATTA ACGTCCAAGA CGCGAGTGCA
GTAAGCGTTG TGGGATGTGG GCTAAGGAAC AAGGAGCTAT TCAGGGAAGT GTTGAGGGAG
GCTTCGTCCT TTGAGGTGGC GTCCATATCC AGGGGACTTA GGAATGTGAG CGCTACATTT
GTGGTGAAAA AAGATGAAGG TTTTAATCTT GCTAAAGACT TACATGAGGT TGTTGTAAAA
TGGATAAACT GA
 
Protein sequence
MLVVKIGGSI QKDERDFELI ANRISQYSSR DRTIVVTSAL KGVTNSLIEA TENRDKAVEI 
VSEIYDRHVK LLSKIVDGPE FDNAFKSISK LADELFRIAW SIKVLDEISA RVRDYILSFG
ERMASVTLGA MLRSRKMDAQ SYPEPLLVTD DSFGEANVIE DLSAMEARKV LDIPSKIVVV
PGFIGKTPTE RYTTVGRGGS DYTATLLGKL LGFPEVRLVT EVPGIMTADP RKFPGAKTIS
RLSLEEAMEL AQMGAKRLHP RTFEPMFDRD IRVYIEGLYD EGYTLVQGTC DSSDKLKGIA
VLDDLKLISV ESTNIVGKIG SAARVMEKAR EAGVNIISLS QPASETTIHI VVDSKNAERL
SSRLQELRDV DSINVQDASA VSVVGCGLRN KELFREVLRE ASSFEVASIS RGLRNVSATF
VVKKDEGFNL AKDLHEVVVK WIN