Gene Msed_1665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1665 
Symbol 
ID5104870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1607505 
End bp1608995 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content50% 
IMG OID640507559 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_001191744 
Protein GI146304428 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00835735 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAGGC AGGCGTATTG GGATGAACCT CTAATCACGG AGTATAAGGG GAAGGGAAGA 
CAAGGCTTCC TCGTACCTAA GGAAGACCTA GACGTGGAAA TTAAGTTACC TGAAAAGATC
AAGAGGGGAA AGGAGCCTGA ACTTCCCGAG GTTTCGGAGC TTGAGGTCGT TAGGCACTTC
GTGAGACTAT CCCAGATGAG CTTTGGCGTT GACACTGGAA TGATGCCTTT GGGATCATGC
ACCATGAAGT ACAATCCAAA GATAGAGGAA GAGACAGGGG TTGCAGATAG GACTCACCCA
TTACAGGATC AGGACACTGT TCAGGGGAAC CTAGAGGTAA TGTACGAAAT GCAAAGGTGG
CTTGCTGAGG CAACGGGAAT GGACGAATGT AGCTTACAGG TTCCGGCGGG ATCAGCTGGC
GAACTGGCTG GCGTGCTCAT GATCAGGAAA TACCACAGGG ATCAGAATAG GAGGAGGGAG
GAGATGCTTG TTGCTGACTC AGCCCACGGA ACAAATCCGG CAAGCGCAGC AATGGCTGGC
TTCTCAGTGA TCTACATCAA GTCTAACCAG GAGGGCTTGG TTGACCTCAA CGTGCTCAAA
GGGACAATAT CAGATAACGT TGCAGGGTTC ATGTTAACTA ATCCTAATAC CTTGGGACTC
TTTGAGGAAA ACATCAAGGA GATAGCTGAG CTGGTTCACT CGGTAGACGG AGTCCTCTAC
TATGATGGCG CTAACCTAAA CGGAATCCTG GGAATAGTGA GACCAGGGGA CATGGGATTT
GATATAGTTC ATCTCAATCT TCACAAGACC TTCGGAGTCC CCCATGGGGG TGGAGGTCCA
GGGGCTGGAG CCGTGTGTGC CAAGGGTAAG ATGACTAAGT ATCTCCCGTA CCCCATAGTG
TCCAAGGGAG AGAGGTACTA TCTTGTTAAG CCTGAGAGGT CCATAGGGAA GATCTCGGTG
TTTAACGGAA ACTTTGGTAA CCTGATGAGG TCCTATGCCT ACATTCTTGG CCTTGGCGGA
AAGGGAGTGT CCATGATTGG AAGAATGAGT ACATTGGCCA CAAACTACCT GATAGCGAAA
CTTAGAGGAG TGAGGGGACT GGAGTTGATG GCTCCTCACC GGTTCAGAAA ACATGAGGTA
GTATTCAGCG CAAAGAAACT GGCAGAGGAA ACTGGGGTGA CTGCGTTTGA TATAGCCAAG
GCTTTACTCG ACAGGGGCTT CTATGCGCCC ACCATATACT TCCCGCCCAA TGTGGAGGAG
GCTCTGATGA TCGAGCCCAC AGAAACTGAG CCCATAGAAG TCCTGGATCA GTACGCCAAC
GCAATCAAGG ATATTGTGGA GAAAGCATAT TCCAACCCTT CCTCCATTAC TTCGGCTCCC
CAAAACACGT CAGTGGGTAG ACTTGATCAG GTTAAGGCAA ATCATCCGAG CACTATGACC
CCAACCTATA GGGTTCTCAA GTCTAGGTTA GCGAGCCAAG GAAGAAAGTA G
 
Protein sequence
MWRQAYWDEP LITEYKGKGR QGFLVPKEDL DVEIKLPEKI KRGKEPELPE VSELEVVRHF 
VRLSQMSFGV DTGMMPLGSC TMKYNPKIEE ETGVADRTHP LQDQDTVQGN LEVMYEMQRW
LAEATGMDEC SLQVPAGSAG ELAGVLMIRK YHRDQNRRRE EMLVADSAHG TNPASAAMAG
FSVIYIKSNQ EGLVDLNVLK GTISDNVAGF MLTNPNTLGL FEENIKEIAE LVHSVDGVLY
YDGANLNGIL GIVRPGDMGF DIVHLNLHKT FGVPHGGGGP GAGAVCAKGK MTKYLPYPIV
SKGERYYLVK PERSIGKISV FNGNFGNLMR SYAYILGLGG KGVSMIGRMS TLATNYLIAK
LRGVRGLELM APHRFRKHEV VFSAKKLAEE TGVTAFDIAK ALLDRGFYAP TIYFPPNVEE
ALMIEPTETE PIEVLDQYAN AIKDIVEKAY SNPSSITSAP QNTSVGRLDQ VKANHPSTMT
PTYRVLKSRL ASQGRK