Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1414 |
Symbol | |
ID | 5104624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1380888 |
End bp | 1381994 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507303 |
Product | D-proline dehydrogenase |
Protein accession | YP_001191496 |
Protein GI | 146304180 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000836219 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0027626 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGGTAG GAATTGTGGG TGGCGGTATT GTTGGGTTAA TGTCAGCCTA TTTTCTAGCT AAGGAGGGAG TCTCCGTCAC GGTATATGAC CCTGCTCCTG GTAAGTACTC TATTCATGCA GCAGGGCTGA TAGAGCCATA CCGTTTTGAC AGGATTAACA CAACCTCCAT GATAGCGAAG ATGTTACGTT TCATGAGGAG GGGGGTCACA GAGGTAAGGC AACTTAATAA AATGTGGGTA GTCGAGCTTC TTTCCTCCCT AAACAAGTCA CCCCCTCAGG AGGCATGGGA CCTAATGAGG GAAATGGCGA GACTATCCCT GGACACTTAC GCCCAAATGG CAGAGGAAAG AAACGATTTC GATTATCATA ACGACGGTCT CCTAGAGGTT TATACAAGTG AGGAAGAGCT GGAGAAGGGA GAGAAGGAGG AGAAACAGAG TCCCTTTTCG CCCAAGTTCG AGGTGACCGA AGTTCCAGGG TTTGCTGGAG GAATATTCTT TCCAGAGCTG AGCCGAATCG CAACCGAGAA GTTCGTGAAA AGGATAACAC GAGAGCTAAC CCAGCTGAAG GTCAATTTTC AGGGAATGGA GGCTCAACCC AATCTTAAGG ACTACACCTT GAATGGTGAG AAATTCGATG TTGTGATCCT GGCCAACGGA GTGTGGATCA CCAAGTCCTT GAAGTTGCCA ATTACCGCGT TTAAGGGCTA TGGGGCATGG GTTAAGGGTA GTTCAAAGAT AAAGAACGCG TTCGTAACCG TGGACGAAGG CGTTGCAGTC TCTCCGTTAT CTGACCACGT CAAGATTACA GGTGGATTCT CAGCTGATTA CGGAAGCGAA TGGAGGACAG ATATCCTGTC TAAGGTCACA AGCCTTGTAA AGGTGGAGGA GGTAATGGAG AGGAACATGG GTTTCAGACC TTGCTCGCCG GACGGTTTTC CTATAATGGG CAGGCTGGAT AACGTTGTGG TTGCAACTGG AGCATGCAGG TTAGGGTGGA GTTATGCCCC AGCTATGGGC TATTACGCCA GCGAACTGGC GCTAGGGAAG AGGAGCACAC TCGGATACGT TTCAAGGTAC GTTGACAGGT TACGCTCTAG CGAGTAA
|
Protein sequence | MKVGIVGGGI VGLMSAYFLA KEGVSVTVYD PAPGKYSIHA AGLIEPYRFD RINTTSMIAK MLRFMRRGVT EVRQLNKMWV VELLSSLNKS PPQEAWDLMR EMARLSLDTY AQMAEERNDF DYHNDGLLEV YTSEEELEKG EKEEKQSPFS PKFEVTEVPG FAGGIFFPEL SRIATEKFVK RITRELTQLK VNFQGMEAQP NLKDYTLNGE KFDVVILANG VWITKSLKLP ITAFKGYGAW VKGSSKIKNA FVTVDEGVAV SPLSDHVKIT GGFSADYGSE WRTDILSKVT SLVKVEEVME RNMGFRPCSP DGFPIMGRLD NVVVATGACR LGWSYAPAMG YYASELALGK RSTLGYVSRY VDRLRSSE
|
| |