Gene GWCH70_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1053 
Symbol 
ID7976833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1104502 
End bp1106214 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content48% 
IMG OID644798006 
ProductFibronectin-binding A domain protein 
Protein accessionYP_002949179 
Protein GI239826555 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000411066 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTTG ACGGAGTATT TACATACGCA ATGACAAAAG AGTTGCAACA AGCGCTTGAA 
GGAGGGCGCA TCACGAAAAT TCATCAGCCG TTTGCCCACG AACTCGTATT GCAAATCCGC
TCGTACGGGC GAAATTATAA ATTGCTGCTG TCTGCGCATC CGAGCTATGC GCGCGTTCAT
TTAACGAATG AAACGTATGA CAATCCGGCA GAACCGCCGA TGTTTTGTAT GCTTTTGCGC
AAACATTTAG AAGGAAGCAT CATCGAAGCG ATCCGCCAAG TCGATTTCGA CCGCATCATC
ACTATTGAGA CAAAAGGGAG AAACGAGATT GGCGATATCC ATACTAGACA GCTCATCATC
GAAATTATGG GACGGCATAG CAACATTATT TTGATCGACA AAGATACAAA CACGATTATC
GACAGCATTA AACACCTCTC CCCTGCCGTC AACCGGTATC GTACGGTGCT TCCTGGCCAT
GAGTACATCG CACCGCCATC GCACGGGAAA ATAAATCCGC TTGAAGCAAC CGAAGAAACA
GTCTTGAAAA AAATTGATTT TCATGCGGGG AAATTAGCGG AGCAGCTCGT TGCTGCATTT
TCAGGCATTT CGCCGCTCTT AGCAAAAGAA ATCGTTTTTC GCGCCGGGCT GGCGAATCGG
GCAACACTGC CGAAAAGCTT TATCGCAGTG ATGGATGAGG TGCGCTCCCA TCGCTTTGCG
CCCGCAATGT ACACAAACGG GGAAAAAGAA TGGTTTTACG TGCTTCCGCT TGCCCACCTG
CAGGCAGAAG CAAAGCCGTT TGACACGCTC AGCAAGCTTC TTGACCGCTT TTACTTTGGC
AAAGCCGAGC GCGACCGCGT CAAACAGCAA GCTCACGACC TCGAGCGGTT TATCGCAAAC
GAAAAAGCGA AAAACGAAAA AAAGCTGATT AAGCTGAAGC AAACATTAGA GGAGGCAAAA
CAAGCGGAAC AATATCGGCT TTACGGGGAG CTGTTGACCG CTAACCTGTA CGCCATCAAA
CGGGGAATGA AAGAAATCGA AGTGATCAAC TATTACGATG AAAATGGCGC GACGGTGACG
ATTCCGCTCG ATCCGCAAAA ATCGCCGTCA GAAAACGCGC AAAGCTATTT TCAAAAATAC
CAAAAGGCGA AAAACTCGCT AAACATCGTC CAAGAACAAA TCAAGCGCAC AAACGAAGAA
ATCGATTATT TGGACACGCT TCTTCAGCAG CTGGAAACCG CCGCTCCGAA AGATGTGGAA
GAAATACGCG AAGAATTAAT CGAACAAGGG TATTTGCGGG CGCGCGCCAC CAAACAAACG
AAAAAGCAGA AACAGCGGAA AATCGAGCTG GACCGCTACG TCGCGAGCGA CGGCACGGAA
ATTCTGGTTG GGAAAAACAA CAAACAAAAC GATTATTTAA CGACGAAACT AGCGCATAAA
GACGAGATTT GGCTGCACAC GAAAGACATT CCCGGCTCAC ATGTCGTCAT TCGCAGCAAA
AATCCGTCCG AGCAAACAAT CGCCGAAGCC GCCAACCTTG CCGCCTACTT CAGCAAAGCG
CGCCAATCAA GCTCTGTTCC CGTCGACTAC ACGCGCATCC GCTACGTCAA AAAACCGAGC
GGCGCCAAAC CAGGCTTTGT TATTTACGAA AACCAACAAA CGATTTACGT TACGCCGGAT
GAGGATTTGG TGATTCGGAT GAAAAAACAA TAA
 
Protein sequence
MAFDGVFTYA MTKELQQALE GGRITKIHQP FAHELVLQIR SYGRNYKLLL SAHPSYARVH 
LTNETYDNPA EPPMFCMLLR KHLEGSIIEA IRQVDFDRII TIETKGRNEI GDIHTRQLII
EIMGRHSNII LIDKDTNTII DSIKHLSPAV NRYRTVLPGH EYIAPPSHGK INPLEATEET
VLKKIDFHAG KLAEQLVAAF SGISPLLAKE IVFRAGLANR ATLPKSFIAV MDEVRSHRFA
PAMYTNGEKE WFYVLPLAHL QAEAKPFDTL SKLLDRFYFG KAERDRVKQQ AHDLERFIAN
EKAKNEKKLI KLKQTLEEAK QAEQYRLYGE LLTANLYAIK RGMKEIEVIN YYDENGATVT
IPLDPQKSPS ENAQSYFQKY QKAKNSLNIV QEQIKRTNEE IDYLDTLLQQ LETAAPKDVE
EIREELIEQG YLRARATKQT KKQKQRKIEL DRYVASDGTE ILVGKNNKQN DYLTTKLAHK
DEIWLHTKDI PGSHVVIRSK NPSEQTIAEA ANLAAYFSKA RQSSSVPVDY TRIRYVKKPS
GAKPGFVIYE NQQTIYVTPD EDLVIRMKKQ