Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_0219 |
Symbol | |
ID | 4059127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | + |
Start bp | 202560 |
End bp | 203450 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641229219 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_603691 |
Protein GI | 94984327 |
COG category | [R] General function prediction only |
COG ID | [COG1408] Predicted phosphohydrolases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACGT CCCCCCGGTC TGCCCCGAGT CTCTCCCGCC GCCGCGTTCT GCGTGGGCTG CTGGGGGGCG GCCTGGCTCT GGGCACGCTG GGCGGGGCCG GGCTGGCACA GGCGGCCCAT TTCGAGGTGA CGCGCACGCA GGCTCTGCTG CCCGGTCTGC GCACGCCACT GCGGGTCGCT TTCCTGACCG ATCTGCACTA CGGCCTGTAT GTCTTCGCGG GCAGTGTTCG CGCCTGGGTG AACGCCGCCA ATGCCGAACG CCCCGACCTC ATCTTGCTGG GCGGCGACTT TCTGGACCTT CGCCCGGAGA CCGACCCCGC CCCGCTGCTG GCCGAACTCG CCCGGCTGCG TGCGCCGCTG GGTGTGTATG GCGTATGGGG CAATCACGAC TACGACTCGT TTGGGCGCCG CGCCTCGCGC CGAGGGGGGC AGGCCCGCCC AGACTGGGCG CAGCGGCGGG CTGACTTGAC GGACGCTTTT GCCCGCGCGG GCGTGCGGGT GCTGCTCAAC CGCGGCCAGG CTATCCGGGA TGACCTGTGG GTGGGGGGCG TGGACGACTT CTTGCAGGGC GAGGTCGATG TGCCGGCCGC CCTGGCGGGA GCTGGGGAGC GCGCCACACT GCTGCTGAGC CATAACCCCG ACATCCTCCC CGACCTCCCC GGCCCGGCGG GGCTGGTGCT GTGCGGCCAC ACCCACGGCG GACAGATTCG TCTGCCGCTG ATCGGGGCGC CGGTCGTTCC CAGCCGTTAT GGGCAGCGCT ACGCATTGGG CTGGGTGCGC GGCGCCTACG GCACGCCCGC CTATGTCAGC CGGGGCCTGG GCACCAGCGG CCTGCCCTTG CGCAACCTTT GCCCGCCGGA AGTGACGGTG TTGACGCTGA CACCGGTCTA G
|
Protein sequence | MDTSPRSAPS LSRRRVLRGL LGGGLALGTL GGAGLAQAAH FEVTRTQALL PGLRTPLRVA FLTDLHYGLY VFAGSVRAWV NAANAERPDL ILLGGDFLDL RPETDPAPLL AELARLRAPL GVYGVWGNHD YDSFGRRASR RGGQARPDWA QRRADLTDAF ARAGVRVLLN RGQAIRDDLW VGGVDDFLQG EVDVPAALAG AGERATLLLS HNPDILPDLP GPAGLVLCGH THGGQIRLPL IGAPVVPSRY GQRYALGWVR GAYGTPAYVS RGLGTSGLPL RNLCPPEVTV LTLTPV
|
| |