Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1053 |
Symbol | |
ID | 7976833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1104502 |
End bp | 1106214 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644798006 |
Product | Fibronectin-binding A domain protein |
Protein accession | YP_002949179 |
Protein GI | 239826555 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000411066 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTTG ACGGAGTATT TACATACGCA ATGACAAAAG AGTTGCAACA AGCGCTTGAA GGAGGGCGCA TCACGAAAAT TCATCAGCCG TTTGCCCACG AACTCGTATT GCAAATCCGC TCGTACGGGC GAAATTATAA ATTGCTGCTG TCTGCGCATC CGAGCTATGC GCGCGTTCAT TTAACGAATG AAACGTATGA CAATCCGGCA GAACCGCCGA TGTTTTGTAT GCTTTTGCGC AAACATTTAG AAGGAAGCAT CATCGAAGCG ATCCGCCAAG TCGATTTCGA CCGCATCATC ACTATTGAGA CAAAAGGGAG AAACGAGATT GGCGATATCC ATACTAGACA GCTCATCATC GAAATTATGG GACGGCATAG CAACATTATT TTGATCGACA AAGATACAAA CACGATTATC GACAGCATTA AACACCTCTC CCCTGCCGTC AACCGGTATC GTACGGTGCT TCCTGGCCAT GAGTACATCG CACCGCCATC GCACGGGAAA ATAAATCCGC TTGAAGCAAC CGAAGAAACA GTCTTGAAAA AAATTGATTT TCATGCGGGG AAATTAGCGG AGCAGCTCGT TGCTGCATTT TCAGGCATTT CGCCGCTCTT AGCAAAAGAA ATCGTTTTTC GCGCCGGGCT GGCGAATCGG GCAACACTGC CGAAAAGCTT TATCGCAGTG ATGGATGAGG TGCGCTCCCA TCGCTTTGCG CCCGCAATGT ACACAAACGG GGAAAAAGAA TGGTTTTACG TGCTTCCGCT TGCCCACCTG CAGGCAGAAG CAAAGCCGTT TGACACGCTC AGCAAGCTTC TTGACCGCTT TTACTTTGGC AAAGCCGAGC GCGACCGCGT CAAACAGCAA GCTCACGACC TCGAGCGGTT TATCGCAAAC GAAAAAGCGA AAAACGAAAA AAAGCTGATT AAGCTGAAGC AAACATTAGA GGAGGCAAAA CAAGCGGAAC AATATCGGCT TTACGGGGAG CTGTTGACCG CTAACCTGTA CGCCATCAAA CGGGGAATGA AAGAAATCGA AGTGATCAAC TATTACGATG AAAATGGCGC GACGGTGACG ATTCCGCTCG ATCCGCAAAA ATCGCCGTCA GAAAACGCGC AAAGCTATTT TCAAAAATAC CAAAAGGCGA AAAACTCGCT AAACATCGTC CAAGAACAAA TCAAGCGCAC AAACGAAGAA ATCGATTATT TGGACACGCT TCTTCAGCAG CTGGAAACCG CCGCTCCGAA AGATGTGGAA GAAATACGCG AAGAATTAAT CGAACAAGGG TATTTGCGGG CGCGCGCCAC CAAACAAACG AAAAAGCAGA AACAGCGGAA AATCGAGCTG GACCGCTACG TCGCGAGCGA CGGCACGGAA ATTCTGGTTG GGAAAAACAA CAAACAAAAC GATTATTTAA CGACGAAACT AGCGCATAAA GACGAGATTT GGCTGCACAC GAAAGACATT CCCGGCTCAC ATGTCGTCAT TCGCAGCAAA AATCCGTCCG AGCAAACAAT CGCCGAAGCC GCCAACCTTG CCGCCTACTT CAGCAAAGCG CGCCAATCAA GCTCTGTTCC CGTCGACTAC ACGCGCATCC GCTACGTCAA AAAACCGAGC GGCGCCAAAC CAGGCTTTGT TATTTACGAA AACCAACAAA CGATTTACGT TACGCCGGAT GAGGATTTGG TGATTCGGAT GAAAAAACAA TAA
|
Protein sequence | MAFDGVFTYA MTKELQQALE GGRITKIHQP FAHELVLQIR SYGRNYKLLL SAHPSYARVH LTNETYDNPA EPPMFCMLLR KHLEGSIIEA IRQVDFDRII TIETKGRNEI GDIHTRQLII EIMGRHSNII LIDKDTNTII DSIKHLSPAV NRYRTVLPGH EYIAPPSHGK INPLEATEET VLKKIDFHAG KLAEQLVAAF SGISPLLAKE IVFRAGLANR ATLPKSFIAV MDEVRSHRFA PAMYTNGEKE WFYVLPLAHL QAEAKPFDTL SKLLDRFYFG KAERDRVKQQ AHDLERFIAN EKAKNEKKLI KLKQTLEEAK QAEQYRLYGE LLTANLYAIK RGMKEIEVIN YYDENGATVT IPLDPQKSPS ENAQSYFQKY QKAKNSLNIV QEQIKRTNEE IDYLDTLLQQ LETAAPKDVE EIREELIEQG YLRARATKQT KKQKQRKIEL DRYVASDGTE ILVGKNNKQN DYLTTKLAHK DEIWLHTKDI PGSHVVIRSK NPSEQTIAEA ANLAAYFSKA RQSSSVPVDY TRIRYVKKPS GAKPGFVIYE NQQTIYVTPD EDLVIRMKKQ
|
| |