Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1572 |
Symbol | |
ID | 7976226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1646078 |
End bp | 1647211 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798463 |
Product | PBS lyase HEAT domain protein repeat-containing protein |
Protein accession | YP_002949635 |
Protein GI | 239827011 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0581284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGAAAA TTAAATCGAT TGAACCAACC CCGAGTCCAA ATACGATGAA AGTATTGTTA GACGAAGAAC TGCCGTTTGG CACCAGCCAT AACTATAAAC CGGACAATGT CGATACGGCG CCTCCTCTTA TTCAGCAGTT AATGAAAATT GAAGGAGTAA AAGGAATTTA TCATGTCGCC GACTTTTTGG CCGTTGAACG CAATCCGAAA TATGATTGGA AAGAGATTTT AACGAAAGTG CGTGAAGTGT TTGGGGAAGA AGTCGACAGC GAGCAAGAAG AAACGAAAAA AACGAATGAA CATTTTGGAG AAGTAAAAGT ATATGTACAA ATGTTGTACG GTTTGCCGAT GCAAGTGAAA TTAACGGATG GGGAACGGGA GCATCGCGTC GGCCTTCCAA AACGGTTTAT CGATGCGGTC ATTGAAGCGC AGAAATATGC CGACAATATC GTGTTAGAAC GGAAATGGGT TGAAAAAGGG GTTCGTTACG GCACATTTGA AGAAATCGGC AATGAAATTG TCGAAGAACT GTCAGCGGCG TACCCGCCGG AACGATTAGA ACGCATGGTA CAAATGTTCC GCCGCGGCGA GCAGGCAAAA ACGCAAAAAC GGCAAAGCAT CAAAGTAACA GAAGAAATGC TCGATGATCC GGATTGGACA AAGCGATATG CGGCGTTAGA ACAAATGGCG GAGCCGACAG AAGACGATAT ACCGGTGCTC GCAAAAGCGC TGAAAGACGA AAAAGTAGCG ATTCGCCGCT TGGCTACTGC GTATTTAGGG ATGATCGGCG GCAAAAAAGT ATTGCCGTAT TTATATGAAG CGCTGAAAGA TAAAGCGGTT TCCGTGCGGC GGACGGCGGG AGACTGCTTG TCCGATATTG GAGATCCGGA AGCCATTCCA GTGATGATTG AGGCGCTGAA AGACCCAAGC AAGCTTGTCC GTTGGCGCGC TGCCATGTTT TTATACGAAG TCGGCGATGA ATCGGCATTG CCGGCATTAA AGGCGGCGGA AAACGATCCG GAATTTGAAG TGAGCATGCA AGTGAAAATG GCGATCGAGC GCATTGAAGG CGGCGAGGAA GCGAAAGGAT CGGTTTGGAA ACAAATGACG GAAAGCAGAA GAAAAGGTCA ATAG
|
Protein sequence | MLKIKSIEPT PSPNTMKVLL DEELPFGTSH NYKPDNVDTA PPLIQQLMKI EGVKGIYHVA DFLAVERNPK YDWKEILTKV REVFGEEVDS EQEETKKTNE HFGEVKVYVQ MLYGLPMQVK LTDGEREHRV GLPKRFIDAV IEAQKYADNI VLERKWVEKG VRYGTFEEIG NEIVEELSAA YPPERLERMV QMFRRGEQAK TQKRQSIKVT EEMLDDPDWT KRYAALEQMA EPTEDDIPVL AKALKDEKVA IRRLATAYLG MIGGKKVLPY LYEALKDKAV SVRRTAGDCL SDIGDPEAIP VMIEALKDPS KLVRWRAAMF LYEVGDESAL PALKAAENDP EFEVSMQVKM AIERIEGGEE AKGSVWKQMT ESRRKGQ
|
| |