Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2633 |
Symbol | |
ID | 7978296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2667969 |
End bp | 2669687 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644799434 |
Product | hypothetical protein |
Protein accession | YP_002950593 |
Protein GI | 239827969 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1796] DNA polymerase IV (family X) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000221598 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGAA ATAAAAAAGA TGTCATTCGT CTGCTGGAGA CGATTGCGTT ATATATGGAA ATCAAAGGAG AAAATCCGTT TAAAATTGCG GCGTTTCGCA AAGCGGCAAG CGCATTGGAA ACTGATGAAC GAAGCATTGC GGAAATAGAC GATTTCACCG CGATTCCCGG CATCGGAAAG GGGACGGCAA GCGTTATTCA TGAATTTTTG GAAACAGGAA CGTCCAGCGT TCTTGAACAA TTGAAACAGA AAATTCCAGA GAGCTTGCTC ACCCTGCTTC GGCTTCCCGG ACTTGGCGGC AAGAAAATCG CAAAGCTATA TCAAGAACTG GGCATTGTGG ATATTGCTTC ATTGAAAGAA GCTTGCCTCG AGCAGAAAGT ACAGCAGCTT CCAGGCTTCG GAAAGAAAAC GGAAGAAAAG CTTTTAGCCG CCATTGAAGA AATCGGCTCT CGCCCGGAAC GGCTTCCGTT AGCCTTTGTG CTGCCGATTG CCGAGGAAAT AGAAAATCAG CTTAAAAATA TGGAAGGAAT CGTTCGTTTT TCTCGTGCCG GCAGTTTGCG GCGAATGAAG GAAACAGTGA AAGATTTAGA TTTCATTATC GCAACGAACG ATCCGCATCT TGTACGGGAA CATTTATTGA AGCTAGCGAA TGTCTCAGAT GTGATTGCAA ACGGTGATAC GAAAGTGTCC CTAGAGCTTC GCTATGAGTA TGAAATTGCT GTTGATTTTC GTTTAGTGAC ACCGGAGCAG TTCGCCACGA CGCTCCATCA TTTTACGGGA TCGAAAGAAC ATAATGTCCG CATGCGGCAG CTGGCAAAAG AGCGCGGTGA AAAAATTAGT GAGTATGGAG TAGAAAATGT GAAAACGGGA GAAGTGAAAA CATTTTCTGA TGAACAAGCG TTTTTTGCCC ATTTCCAGTT GCCGTTTATT CCGCCGGAAC TAAGGGAAGA TGGGACCGAA GTCGACCGCT ATCGCGACGA CTATTCGCTT CTTTGCCTTT CCCACATTCA AGGAGATTTG CATATGCACT CTACTTGGAG CGACGGAGCG TGCTCGATCG AGGAAATGGC GGAAGCATGC CGGAAAAAAG GCTACCGCTA TATGGCGATT ACCGATCATT CTCAATATTT AAAGGTCGCC AACGGGCTGA CGGTCGAACG GTTAAAGCGG CAGCGCGAAG AAATTGAACG GTTAAACGCG AAATATGATG ATTTTACAAT TTTGGCTGGA ATAGAAATGG ATATTTTGCC AGATGGGACG CTTGATTACG ATGATGGCGT TCTCGAAGAA CTTGACTTTG TCATCGCTGC GATTCATTCA AGTTTTTCCC AGTCGCGTGA CGTGATTATG AAGCGTCTTG CTGCTGCGCT TCGCAATCGT CATGTTGATT TGATCGCTCA TCCGACAGGG CGGTTAATCG GAAAGCGAGA CGGATATGAC GTAGATATAG ACATGCTTAT CGAATTGGCG CGGGAAACGA ATACGGCGCT TGAGTTAAAT GCGAATCCGA ACCGTCTCGA TTTGTCGTAT TCCTATTTGA AAAAAGCGCA AGATGCTGGA GTAAAAATCG CGATTAATAC AGATGCCCAC CATTTGGACA TGCTTGACCA TATGGAAATA GGGGTCATAA CCGCACGAAA AGGATGGATT CGCAAGGAAA CGGTGATCAA TACATGGTCT CTTGAAGAGT TGCGCAGCTT TTTGCAACGC AATCGGTAA
|
Protein sequence | MKGNKKDVIR LLETIALYME IKGENPFKIA AFRKAASALE TDERSIAEID DFTAIPGIGK GTASVIHEFL ETGTSSVLEQ LKQKIPESLL TLLRLPGLGG KKIAKLYQEL GIVDIASLKE ACLEQKVQQL PGFGKKTEEK LLAAIEEIGS RPERLPLAFV LPIAEEIENQ LKNMEGIVRF SRAGSLRRMK ETVKDLDFII ATNDPHLVRE HLLKLANVSD VIANGDTKVS LELRYEYEIA VDFRLVTPEQ FATTLHHFTG SKEHNVRMRQ LAKERGEKIS EYGVENVKTG EVKTFSDEQA FFAHFQLPFI PPELREDGTE VDRYRDDYSL LCLSHIQGDL HMHSTWSDGA CSIEEMAEAC RKKGYRYMAI TDHSQYLKVA NGLTVERLKR QREEIERLNA KYDDFTILAG IEMDILPDGT LDYDDGVLEE LDFVIAAIHS SFSQSRDVIM KRLAAALRNR HVDLIAHPTG RLIGKRDGYD VDIDMLIELA RETNTALELN ANPNRLDLSY SYLKKAQDAG VKIAINTDAH HLDMLDHMEI GVITARKGWI RKETVINTWS LEELRSFLQR NR
|
| |