Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1909 |
Symbol | |
ID | 7978737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1961707 |
End bp | 1962972 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644798740 |
Product | hypothetical protein |
Protein accession | YP_002949910 |
Protein GI | 239827286 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3359] Predicted exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000148245 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATGA AGCAGAAATT AGCAAGATGG AAGGAACAAC TTGCATCCCG GGCTTCCGTT CAGGAAGAAC GGCCCGATGT GCTCTTTGGA GAGCAACAAG AGAAAGAGGT TCCATTTCTT GATGAATGGC AGAAAAAACA TGTGCAGCCG TTCTTTTTTG ATGGGGATTA TTGTTTGATT CGCGAAGTAG TATATCCGTT AGATTATCAG CATGGACGAT ATCGACTTGG AGAGTTTCAT CATATTCATG CCCGTTGGCA AGACGCTTCT TTTACACATC CGCTTTCGAG CAAAGGGCAT GAAGCGAGCG ATTTATTTTT CTTTGATACG GAGACGACCG GGCTTAGCGG CGGAACGGGA CATGTCATTT TTTTGCTTGG CCATGCCCGC GTATATGAAG ATCGGGTTGT TGTCCGCCAG CATTTTTTGC CGCACCCGGG AGCGGAAGTG GCATTGTATC AAAGTTTTTT ATCGGAAGTC GACTATACAA CGCTTGTTAC GTATAACGGA AAAGCATTCG ATTGGCCGAA AGTCAAGACG CGCCATACGC TGATTCGTGA TGCTGTCCCG AAACTGCCGG CGTTTGGCCA TTTCGATTTA TATCACGCAT CAAGGAGAAT GTGGAAACAA AAGCTTGAGT CTGTTCGCCT TTCCAATGTC GAAAAAGAGA TATTGCAAAT TGAGCGAGAA GAAGATGTTC CCGGTTTTTT GGCGCCGATG ATGTATATGG ACTTTCTATC GGCGCCGCAT CCTGATCGAA TTTTTCCGGT ATTTCTCCAT AATGAACTTG ATGTTTTATC GTTAATTTGC CTCTATATTC ATTTATCGAA ACAGCTGCTA GAAGCGCCGC AACTAAAAGA TGCATTGGAA CAGTTGGAAA CAGCTCGTTG GCTCGAGACG TTAGGAGAAA CAAATGCCGC GAAAAACGTG TATGAGCGCG TGATCGAAAA AGAAACAAAA GAATCGTGGC AGGCCAAATG GCAACTATCG CTGTTATATA AAAAAGAAAA ACGGTACGAA AAAGCAGTGG ACATATGGAA AGAATTATGG CAGCATGGCA GTGATACGTG GAAGATGAAA GCCGGGGTTG AATTGGCAAA AGCGTATGAA CATTATTTTC GTGATGCCCA TATGGCGCAT CACTATGCGA TCAACGTATA TGAACGATGG AAAACACTAT CTCGTTCCTA TAAACAGCGG AATACTACAC AAGAGTTAGA GTTGATCAGG CGTATAGAAC GGCTTCAGCG GAAATTAAAT CATTAA
|
Protein sequence | MSMKQKLARW KEQLASRASV QEERPDVLFG EQQEKEVPFL DEWQKKHVQP FFFDGDYCLI REVVYPLDYQ HGRYRLGEFH HIHARWQDAS FTHPLSSKGH EASDLFFFDT ETTGLSGGTG HVIFLLGHAR VYEDRVVVRQ HFLPHPGAEV ALYQSFLSEV DYTTLVTYNG KAFDWPKVKT RHTLIRDAVP KLPAFGHFDL YHASRRMWKQ KLESVRLSNV EKEILQIERE EDVPGFLAPM MYMDFLSAPH PDRIFPVFLH NELDVLSLIC LYIHLSKQLL EAPQLKDALE QLETARWLET LGETNAAKNV YERVIEKETK ESWQAKWQLS LLYKKEKRYE KAVDIWKELW QHGSDTWKMK AGVELAKAYE HYFRDAHMAH HYAINVYERW KTLSRSYKQR NTTQELELIR RIERLQRKLN H
|
| |