Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2280 |
Symbol | |
ID | 4059229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | + |
Start bp | 2402370 |
End bp | 2403545 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641231330 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_605743 |
Protein GI | 94986379 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000359779 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTCG ATCGCCGTGA CTTCCTGAAA TACTCCGCCC TCGCCGTCGC CGCAACCAGC GGCATGCCGG GCTTTCTCGC CCGTGCCGCC ACCCAGGCGA GCGGCACCCG GACGCTGGTC GTGATCCAGC TCACCGGGGG CAATGACGGA CTCAACACTC TGATTCCCTA CTCCAACGGC GCCTATTACG CCGCGCGGCC CAACATCGCC ATTCCCAAAA AGGACGTGCT GACCCTCACC CCCGACCTCG GCATGCATCC TGCGCTCAAG CCGCTGATGC GCCTGTGGGA TGCTGGGCAG CTCGCCTGGA TGGAGAATGT CGGCTACCCC AACCCCAACC GCAGCCATTT TGCGAGCATG GCGATCTGGC ACACCGCCGA CCCCATGCAG GCGCAGGCAG AGGGCTGGAT TGGTCGCATC GCGGAAAAGA TCGGTGATCC CTTTTGCGCG TCGAATCTGG GCAGCGTGAC ACCGCTGGCC CTGCAGGCAG CTGACTTTAG CCTGCCCAGC ATCGACAGCG TGGACAACTT TCAGGTGAAG CTCCCGGCAG GGCTAGACGG TGCCTTCCAG GCCCTGCTGA ACACCGCGCG CAGCGGCGAG GCGGCCTACC TCCAGCGCGC CACCCGGCAG ATGCTCGCCA ACACGCAGAA GGTGCAGCAA AACGTCTCGA AGTACCGCCC AGGTGCTCAG TATCCTGAGG GCCGGTTCGC CGCGCAGTTG CAGGACGCGG CCCGGCTGAT TGCGGCGGGA ACTGGACAGC GGGTGCTGTA CGTGACCCTG GGCAACTTTG ACACCCACGC CGGACAGCGC GCTGAACAGG ACGAACTCCT GGGGCAGCTC GCCGCGGGCC TCGCGGCGTT CCAGGCCGAT CTGGAGGCGC AGGGCCTCGC AGAGCGGGTG ATGGTGATGG GCTTTTCCGA GTTCGGGCGG CGAGTGGCCG AGAACGCCAG CGCGGGCACC GACCACGGCA AGGGCAGCGT GATGTTCGCC CTGGGCCGAG GCGTCAAGGG CGGCATCCAC GGCGATAGCC CCGACCTGGA AAACCTGTCC GACGGGGACA TCCAGTACAA GCAGGATTTC CGCGGCGTGT ATGCGGAGGC GCTGACCAAA TGGTTGGGAC TGGACGCCCG GGAGATCCTG CGAGGCGACT TCCAGGGACC GGGATGGGTG GCCTGA
|
Protein sequence | MPLDRRDFLK YSALAVAATS GMPGFLARAA TQASGTRTLV VIQLTGGNDG LNTLIPYSNG AYYAARPNIA IPKKDVLTLT PDLGMHPALK PLMRLWDAGQ LAWMENVGYP NPNRSHFASM AIWHTADPMQ AQAEGWIGRI AEKIGDPFCA SNLGSVTPLA LQAADFSLPS IDSVDNFQVK LPAGLDGAFQ ALLNTARSGE AAYLQRATRQ MLANTQKVQQ NVSKYRPGAQ YPEGRFAAQL QDAARLIAAG TGQRVLYVTL GNFDTHAGQR AEQDELLGQL AAGLAAFQAD LEAQGLAERV MVMGFSEFGR RVAENASAGT DHGKGSVMFA LGRGVKGGIH GDSPDLENLS DGDIQYKQDF RGVYAEALTK WLGLDAREIL RGDFQGPGWV A
|
| |