Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1004 |
Symbol | |
ID | 7976791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1053710 |
End bp | 1054915 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797957 |
Product | hypothetical protein |
Protein accession | YP_002949130 |
Protein GI | 239826506 |
COG category | [R] General function prediction only |
COG ID | [COG1323] Predicted nucleotidyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGG TCGGAATCAT TGTCGAATAT AATCCGTTTC ATAACGGGCA TTTATATCAC TTAGAAGAAA CCAAAAAACA AACTGGAGCA GATTGCATTA TTGCCGTCAT GAGCGGAAAT TTTCTGCAGC GCGGAGAACC TGCGCTCGTA TCAAAATGGG CGCGAACAAA AATGGCATTG TCAGCCGGGG TAGATATTGT CATTGAGCTG CCGTATGCAT TTGCGGTTCA ATCTGCTGAA CAATTTGCCA GCGGCGCGGT TACATTGCTT CATTCCTTAT TTTGTGAAGA AATTTGCTTC GGAAGCGAAA ATGGAAATAT AACCGCATTT ATCGATGCGG CAAAAACGTT TTTAGAGCAA AAACAGCAGC ATGACAGTTA TGTGCAAGAA GCGCTTCAGG AAGGGGTAAG CTATCCGCGG GCGAACGCAG AAGCCTGGAA ACGATTGAAC GCCACTAACC TTGATTTATC TAAGCCGAAC AATGTGCTCG GTCTTGCGTA TGTCAAAGCG ATTTTACAAA AACAAATTCC GATCACTCCT CGTACGATTC GCCGCATCGC TTCCGATTAT CATGACAAAA CGTTCTCTCA TCCATCGATC GCTAGCGCGA CAAGCTTAAG AAAAGCGCTT AAAGGGTCGC TTGCACATTT AGAAACCATC GCCCCGTATA TTCCAGGCAC TACAAAACAA ACACTCGAGC AATATTATGA TACATACGGC ATGTTTCATG AATGGGAAGC CTATTTTCCG TTTTTAAAGT ATCGCATCAT GACAGCGGAA GAGGCGGAGC TGCGCCAAAT TGCCGGCGTT GACGAAGGTA TCGAACACCG CTTAAAGCAA GAAATTGTTG CTGCCCCGAC GTTTTCCGCT TTTATGAACT CGATCAAAAC AAAACGATAT ACGTGGACGA GGCTGCAACG AATATGCACT CATATTTTAA CGAATTTTAC AAAAGACCAG CGAAAAAAAA CAGAAACGCC GACATATATT CGTTTGCTTG GAATGAGCAG CAACGGGCGC CGCTATCTAC AACATGTAAA AAAACATCTG CCGCTTCCGC TCGTTACAAA AGTGTCAAAT TTAAAACATG ACCCGATTTA CCAGCAAGAA AAAAAGGCGT CTTTTGCTTA TGCCGCAATA TTTCCTGAAC CTGCTCGCAC CAATGTGCTG AAAGAGGAGT ACGCTACTCC TCCTCTTTTG CAATAA
|
Protein sequence | MKAVGIIVEY NPFHNGHLYH LEETKKQTGA DCIIAVMSGN FLQRGEPALV SKWARTKMAL SAGVDIVIEL PYAFAVQSAE QFASGAVTLL HSLFCEEICF GSENGNITAF IDAAKTFLEQ KQQHDSYVQE ALQEGVSYPR ANAEAWKRLN ATNLDLSKPN NVLGLAYVKA ILQKQIPITP RTIRRIASDY HDKTFSHPSI ASATSLRKAL KGSLAHLETI APYIPGTTKQ TLEQYYDTYG MFHEWEAYFP FLKYRIMTAE EAELRQIAGV DEGIEHRLKQ EIVAAPTFSA FMNSIKTKRY TWTRLQRICT HILTNFTKDQ RKKTETPTYI RLLGMSSNGR RYLQHVKKHL PLPLVTKVSN LKHDPIYQQE KKASFAYAAI FPEPARTNVL KEEYATPPLL Q
|
| |