Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3120 |
Symbol | |
ID | 7976765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3142217 |
End bp | 3144100 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644799906 |
Product | S-layer domain protein |
Protein accession | YP_002951045 |
Protein GI | 239828421 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4193] Beta- N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGGTTAC TTGTCAGTCT TTTTCTAGTT TTTTGCTTAT TTTTTGCCAA TATTTCCGTT TCATTCGCAG ATGATATTAC AGGAATTGCG CTTGAAAAAG AAATGCGCGC GATGGTGAAT CAAGGAGTCG TTCAAGGATA TCCTGATGGC AAATATCGTC CGAACGAATC TGTCACTCGC GGCCAGTTTG CTACGTTTGT CGCCAGAGCA TTGCAGCTTC CGAAAGGATC GGGGAACTTT TCCGACGTGT CACCATCTTC CGAACTAGCC GACGGCATTT ACAAAGCAAG CGCGGCAGGG ATCGTTCAAG GGTATTCGAA CGGCACGTTT GGCGTGAATA ATAAAATTAC AAGAGAAGAA ATGGCAATAA TGATCGACCG AACGCTAGAT TATTTAGGAA TAGAGAAAAA GCAGGCATCG CTTGACCATT TTACCGATGT GGATGGGCTC TATTCGGCTT CGAAAATCGC AATCTCTCAT AATGTGTATT ATGGCATTAT TCGCGGAATT CCAAACACGG ATGGAACCAC GTTCCGATTT GCTCCGAAAG CGTACGCGAC AAGAGCGCAC GCCGCTGCCT TTCTTTACCG CCTGCTTGAA GTGTGGGCGG AGCAGACGCC GGAAATGGCG TATCAAATTG CTGAGATTCA AAACGGACAA TTAACGCTGC TGCCAAAGCG CTATGCCACA TTTGCACAGG CGGAAGCGTC TGTTACGAAT TGGTCTTCTC AAGTGGTGGT GCAAGGCACC AAAATTGTTA AGATGGCTAG CGGAAACGTT ATTGCCCAGC CGTCTCCTGG AAAATCAACC ACAATAATCT ATCAATCTAA TTTAAGTACG AGTCTTACGT ATGTCGCGCC AAATACAGAA ATGAAATATT TAGATGCCGA TGAAGAAAAA GTAAAAGTGC AAATCGCCAA TACGATCGGG TATGTGAAAC AGTCGGAAGT GATGCTAGTG CCAACGGCGT TGCTGCAAGG CCGGTCGTAT TACATTGCGC AAAAAGGGGA GTTGTATCAT TACATTTACA ATACGACAAG CAACAAATAC GTAAGCTATC TGTACGGCAA AGCGCCGTCG TTTATGCAAG ATGGGCAAAA GTATTACAGC TGGGATGGCG AAACGTTTTA TAACACAGCA GGCAAGCCCA TCGGCACGGC GTATCAATAC TTTAACGTCC TGCCAATCCG TACGAAAACA AATTATACGG CCGAAGAATT GAATCGGTTT GTCGCGGCAA ACCGTTCTGA CAGCCCATTA AAAACATTAG GAGAAGCCTT TAAAAAGACG GAAGAAAAAT ATAACGTAAA CGCGCTTTAT TTGTTGGCGC ACGCGATTCA TGAAAGCGAT TGGGGAACTA GCGAAATCGC GAAAGAGAAG AAAAACTTAT TCGGTATTCG AGCGGTTGAT AGCGATCCAC TCAACAGTGC GGTGAAGTTT AACACCTTCG AAGATTGCAT CAATTATATG GCACAGACGA TAGTATCGAA CAGATATGCC AATCCAAAGG ACTGGCGGTA TAGTGGAGCG GTTCTTGGTG ATAAGACCAT TGGCTTTAAC GTCCGTTATG CGTCTGACCC ATATTGGGGA CAAAAAATCG CTGGAATTAT GTATCGGGCT GACAAATTTT TAGGCTGGAA AGACTGGGGC AAATATACGA TTATGGGAAC AACAGTAGAC GGCGTGAACG TCCGATTGCA GCCAGCGGCG GTTCCGCCTG TATTCTATGA ATACCGTCTT GCAAACACTC CTGTCGTTAA AATCGGGGAA ACAGCGAAAC AGCCGGACGG CTATGTATGG TATCAAATTC TTTCCGATTT ACCGACGGAA GAGAATGCTT ATATCCGCAG CGACTTATTA AAACCGCTGT CAATCGCAAA ATAA
|
Protein sequence | MRLLVSLFLV FCLFFANISV SFADDITGIA LEKEMRAMVN QGVVQGYPDG KYRPNESVTR GQFATFVARA LQLPKGSGNF SDVSPSSELA DGIYKASAAG IVQGYSNGTF GVNNKITREE MAIMIDRTLD YLGIEKKQAS LDHFTDVDGL YSASKIAISH NVYYGIIRGI PNTDGTTFRF APKAYATRAH AAAFLYRLLE VWAEQTPEMA YQIAEIQNGQ LTLLPKRYAT FAQAEASVTN WSSQVVVQGT KIVKMASGNV IAQPSPGKST TIIYQSNLST SLTYVAPNTE MKYLDADEEK VKVQIANTIG YVKQSEVMLV PTALLQGRSY YIAQKGELYH YIYNTTSNKY VSYLYGKAPS FMQDGQKYYS WDGETFYNTA GKPIGTAYQY FNVLPIRTKT NYTAEELNRF VAANRSDSPL KTLGEAFKKT EEKYNVNALY LLAHAIHESD WGTSEIAKEK KNLFGIRAVD SDPLNSAVKF NTFEDCINYM AQTIVSNRYA NPKDWRYSGA VLGDKTIGFN VRYASDPYWG QKIAGIMYRA DKFLGWKDWG KYTIMGTTVD GVNVRLQPAA VPPVFYEYRL ANTPVVKIGE TAKQPDGYVW YQILSDLPTE ENAYIRSDLL KPLSIAK
|
| |