Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1101 |
Symbol | |
ID | 7977589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1152269 |
End bp | 1153153 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644798054 |
Product | DNA protecting protein DprA |
Protein accession | YP_002949227 |
Protein GI | 239826603 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00287455 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTCAT CCGTTCGTGA GCGCCTTATT CATCTTCACC ACTGCCGCGG GGCAGGCTGG AAAACGATCC ATCGCTTATT GGAAATCGAT CCGACCTTTT CCTCTGTATT TACCCTTCCA TCTTCTGCAT TGCGAACGCG TATTCCTTTA TCCTCCCAAC AATATACACA ATTTTTCCAA GATTTACATT CCCCTCATAT TGAAAGTATG ATAAAAACAT ATAAGGACAA AAATATCCAC GTCATCACAA TTTTTGACTC AGACTATCCA CCTTTGTTAA AACATATATA CAAGCCCCCA TGGGTGCTAT ATGCAAAAGG GAATATTCGC CTTTTGTCGT CATTTAAAAT GATTAGTATT GTTGGGACGA GACAGCCGAC AAAAGAAGGG ATACAATCGT TGCGGCAACT AGTTCCGCCG CTGGTGGCGA ACGATTGGGT AATCGTAAGC GGTCTTGCGG TAGGAATCGA TACGTTAGCG CACGAAATGA CAATTGAAAA CGGTGGACAT ACAATCGCGA TCATCGCAGG AGGATTTGAA CATATATATC CAAGACAAAA TAAAAGACTA GCAGACCAAT TAATGGGCGG ACACCTTATT TTATCGGAAC ATCCGCCGCA TGTGCGGCCG CAAAAATGGC ATTTTCCGCT GCGCAATCGG ATTATCAGCG GAATATCGCT TGGAACGATC GTCGTACAAG CGAAAGAGAG AAGCGGCTCA TTAATTACTG CTCTATTAGC GCTTGAGCAA GGAAGAGAGG TGTTTGCCGT TCCTGGTCCA ATTTTTTTGG AACAATCAAA AGGGCCAAAC ATGTTAATAC AGCAAGGAGC AAAATTAGTT CATTCGGCAA CCGATATATT CGAGGAATTT TCCTACATTC GATAA
|
Protein sequence | MYSSVRERLI HLHHCRGAGW KTIHRLLEID PTFSSVFTLP SSALRTRIPL SSQQYTQFFQ DLHSPHIESM IKTYKDKNIH VITIFDSDYP PLLKHIYKPP WVLYAKGNIR LLSSFKMISI VGTRQPTKEG IQSLRQLVPP LVANDWVIVS GLAVGIDTLA HEMTIENGGH TIAIIAGGFE HIYPRQNKRL ADQLMGGHLI LSEHPPHVRP QKWHFPLRNR IISGISLGTI VVQAKERSGS LITALLALEQ GREVFAVPGP IFLEQSKGPN MLIQQGAKLV HSATDIFEEF SYIR
|
| |