Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0351 |
Symbol | |
ID | 7977464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 398595 |
End bp | 400550 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644797342 |
Product | group-specific protein |
Protein accession | YP_002948542 |
Protein GI | 239825918 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00317496 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAC TGAACGATTC CGCGCGTCTG AAGGTGAAAA GAGACACGTT TTTTCTCCCT GATTCAAATG GCGGCGTGTA TTTTAGGAAT AACTCCAGCT CATTTCGTAT GAAAGGCAAA ACAATCTATC AGTGGATTGA AAAACTAATG CCAATGTTCA ATGGAGAGCA CACATTGGGA GAATTGACAA AAGGATTATC GGCCCCATAT AGGAACAGGG TGTATGAAAT CGCAGAAATT CTGTACCGAA ACGGCTTTGT TCGGGATGTG AACCAAGATC GTCCGCATCA ATTGGACAGC AAGATTCTCA AAAAGTATGC TTCACAAATT GAATTTATAG AAAGTTTCGT TGATTCCGGT GCGTTCCGTT TTCAAGTATA TCGCCAATCT AAAGTATTGG CAGTCGGCTC TGGACCTTTT TTGGTCTCAT TAGTTTCGGC GTTGATCAAA TCTGGATTGC CTAAGTTCCA TGTTCTGATT ACAGACTCGA TGCCTACTAA TCGACAGCGG TTGAAGGAAC TGGCGGAGCA TGCCCGCAAA ATGGACTCCG AAGTAGCAAT AGAGGAAATA TCTCTACACA GGGGAGCAGG GGAGAGTTCT TGGCGAGAAG TGGTACAGCC ATTTGAGTGG ATTTTATATG TCTCTCAGGA AGGCAATGTA GAGGAACTAC GGGCCCTTCA TGCCGTTTGC AGGGAGGAGA AGAAGGGGTT TCTCCCTGCC ATATCTTTGC AGCAAGTGGG TTTAGCTGGT CCATTAGTGC ATCCGGACTC TGAGGGATGC TGGGAGTCCG CATGGCGCCG CATACACCGA TCCGTGCTTC GCGAAGACCG GCTGGTGCAA GCTTTTTCTG CAACAGCGGG AGCAATGTTG GCGAATGTGA TTGTATTTGA ATTATTTAAA AAAGTTACAG GAGTGACGAA ATCAGAACAG AGAAATCAAT TTTTCCTTCT TGACTTAGAA ACATTGGAAG GTGACTGGCA TTCGTTCATA CCACATCCGC TGGCGACGAC GGAACGTGTG ACGGCTGAAT TGATTCAAGA TCTTGATTCG AGGCTTAAAC AAAATTCGAG CCGCGATGAT TTGAGCAGAT TGTTTCATTA TTTCAGTCAA TTGACATCCT CAAAATCGGG AATTTTCCAT ATATGGGAGG AAAGGAACTT GAATCAGCTT CCGTTGTCCC AGTGCTGCGT TCAAGCGGTT AACCCATTAT CGGAGGGACC GGCTGAGCTA TTGCCAGAAG TTGTCTGTGC AGGTCTAACA CATGAGGAGG CGCGGCGGGA AGCAGGGCTA GCTGGGATTG AATCGTATGT ATCGGGAATG ATAGATTTAC TCGTTAACAC CGAAAAAGAG GTTGGTGTGG TCACACCACA GGAATTTATT GGTGTCGGAG CAGGGGAAAC GATGGCGGAA GGCGTTTGCC GGGGATTGAA AAAGTGCTTG GATGAGGAGC TGAGCAAACG GCAGGTTAAC CGAAGAGAAC CTATTTTTCA GGTACGATTG GGTGCGGTGG AGGATGAACA CTGCAGATTT TATTTGCAGG CACTGGCGAC GATGCATGGA CCACCGACGA TCGGCTTGGG AGAAAAGGTG TCAGGCTTTC CTGTAGTATG GGTCGGCACG GGCGGCCGTT GGTATGGCAG CGCAGGTTTG AATATCACAA TGGCGTTGCG GAAAGCGCTA GAGCAGGCGA TTATGGATGC ACAAAATCAA GCGACTTCTT TCCAAATACA AGCGCTGGAG GAATCATCCA TTTTTCTAAA TGAAGAGAAG CCACTGAGAC TGGAGATCCC CGCTTGTGAA GAGACGACGC AATCAGAGCT CTTGCAGTCC GCCATGCAGG TTTTGGAACA AAACCGTATG CGGCTCTTCG TCTTTGATTT AGCGATAGAA CCTTTTTTGA AAGAGGAACT GGCAGGGGTG TTTGGCGTGC TGTTGAGAAA GGAGCATTCC CGTTAG
|
Protein sequence | MSKLNDSARL KVKRDTFFLP DSNGGVYFRN NSSSFRMKGK TIYQWIEKLM PMFNGEHTLG ELTKGLSAPY RNRVYEIAEI LYRNGFVRDV NQDRPHQLDS KILKKYASQI EFIESFVDSG AFRFQVYRQS KVLAVGSGPF LVSLVSALIK SGLPKFHVLI TDSMPTNRQR LKELAEHARK MDSEVAIEEI SLHRGAGESS WREVVQPFEW ILYVSQEGNV EELRALHAVC REEKKGFLPA ISLQQVGLAG PLVHPDSEGC WESAWRRIHR SVLREDRLVQ AFSATAGAML ANVIVFELFK KVTGVTKSEQ RNQFFLLDLE TLEGDWHSFI PHPLATTERV TAELIQDLDS RLKQNSSRDD LSRLFHYFSQ LTSSKSGIFH IWEERNLNQL PLSQCCVQAV NPLSEGPAEL LPEVVCAGLT HEEARREAGL AGIESYVSGM IDLLVNTEKE VGVVTPQEFI GVGAGETMAE GVCRGLKKCL DEELSKRQVN RREPIFQVRL GAVEDEHCRF YLQALATMHG PPTIGLGEKV SGFPVVWVGT GGRWYGSAGL NITMALRKAL EQAIMDAQNQ ATSFQIQALE ESSIFLNEEK PLRLEIPACE ETTQSELLQS AMQVLEQNRM RLFVFDLAIE PFLKEELAGV FGVLLRKEHS R
|
| |