Gene Msed_1414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1414 
Symbol 
ID5104624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1380888 
End bp1381994 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content49% 
IMG OID640507303 
ProductD-proline dehydrogenase 
Protein accessionYP_001191496 
Protein GI146304180 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000836219 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0027626 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGTAG GAATTGTGGG TGGCGGTATT GTTGGGTTAA TGTCAGCCTA TTTTCTAGCT 
AAGGAGGGAG TCTCCGTCAC GGTATATGAC CCTGCTCCTG GTAAGTACTC TATTCATGCA
GCAGGGCTGA TAGAGCCATA CCGTTTTGAC AGGATTAACA CAACCTCCAT GATAGCGAAG
ATGTTACGTT TCATGAGGAG GGGGGTCACA GAGGTAAGGC AACTTAATAA AATGTGGGTA
GTCGAGCTTC TTTCCTCCCT AAACAAGTCA CCCCCTCAGG AGGCATGGGA CCTAATGAGG
GAAATGGCGA GACTATCCCT GGACACTTAC GCCCAAATGG CAGAGGAAAG AAACGATTTC
GATTATCATA ACGACGGTCT CCTAGAGGTT TATACAAGTG AGGAAGAGCT GGAGAAGGGA
GAGAAGGAGG AGAAACAGAG TCCCTTTTCG CCCAAGTTCG AGGTGACCGA AGTTCCAGGG
TTTGCTGGAG GAATATTCTT TCCAGAGCTG AGCCGAATCG CAACCGAGAA GTTCGTGAAA
AGGATAACAC GAGAGCTAAC CCAGCTGAAG GTCAATTTTC AGGGAATGGA GGCTCAACCC
AATCTTAAGG ACTACACCTT GAATGGTGAG AAATTCGATG TTGTGATCCT GGCCAACGGA
GTGTGGATCA CCAAGTCCTT GAAGTTGCCA ATTACCGCGT TTAAGGGCTA TGGGGCATGG
GTTAAGGGTA GTTCAAAGAT AAAGAACGCG TTCGTAACCG TGGACGAAGG CGTTGCAGTC
TCTCCGTTAT CTGACCACGT CAAGATTACA GGTGGATTCT CAGCTGATTA CGGAAGCGAA
TGGAGGACAG ATATCCTGTC TAAGGTCACA AGCCTTGTAA AGGTGGAGGA GGTAATGGAG
AGGAACATGG GTTTCAGACC TTGCTCGCCG GACGGTTTTC CTATAATGGG CAGGCTGGAT
AACGTTGTGG TTGCAACTGG AGCATGCAGG TTAGGGTGGA GTTATGCCCC AGCTATGGGC
TATTACGCCA GCGAACTGGC GCTAGGGAAG AGGAGCACAC TCGGATACGT TTCAAGGTAC
GTTGACAGGT TACGCTCTAG CGAGTAA
 
Protein sequence
MKVGIVGGGI VGLMSAYFLA KEGVSVTVYD PAPGKYSIHA AGLIEPYRFD RINTTSMIAK 
MLRFMRRGVT EVRQLNKMWV VELLSSLNKS PPQEAWDLMR EMARLSLDTY AQMAEERNDF
DYHNDGLLEV YTSEEELEKG EKEEKQSPFS PKFEVTEVPG FAGGIFFPEL SRIATEKFVK
RITRELTQLK VNFQGMEAQP NLKDYTLNGE KFDVVILANG VWITKSLKLP ITAFKGYGAW
VKGSSKIKNA FVTVDEGVAV SPLSDHVKIT GGFSADYGSE WRTDILSKVT SLVKVEEVME
RNMGFRPCSP DGFPIMGRLD NVVVATGACR LGWSYAPAMG YYASELALGK RSTLGYVSRY
VDRLRSSE