Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3053 |
Symbol | |
ID | 7977416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3072335 |
End bp | 3073594 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644799847 |
Product | hypothetical protein |
Protein accession | YP_002950986 |
Protein GI | 239828362 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTTC CAATTAAAAA AAGAAACATC ATCGTCGCCA GTATGCTGTT AACCGCACTG GTGATTCGGC TTTTTGTTCT ATGGAAGTAT GGATTAGACC TTACATTAAA TAGCGATGAT ATGGGATATG TAAGAAGCGG CAAAAGACTC TTGGAAAATG GGATGCTCAC ATACCACCAT GAAAGCGAGC CAACTGTCCA TATTATGCCA GGGATGCCGA TCTTATTGGC TGCTATTTTT TTCTTTTTCG GCACTGGCGA TATAGGTCTT TACGCAGCAA AAGTGGTGAT GATTTTATTT GGCGTAGCCA GTGTCTATCT TATCTATGTA ATCGGAAAAG ATATGAACCA AGAATGGGCT GGCATCATCG CTGGATTTTT CACTGCACTA TTTGTTCCAC TTATTGAAAC GGACAATTTA ACTTTGACCG AACCGCCTTT TCTATTCGGA TTTCTTCTTT TCATTCATTT CGCTATCCAA CTTGGACGAA ATCATAAAAT GTCTACTTTC TATTGGCTGA TGTTTTCCTA CTTGTTCTGC TTGTTGTTTC GAGCTACCTT TGCTCTGATT CCATTTGCGC TTCTCGGCTA CTTTCTGCTC ATCAAATATC CGCTTCGGCT CGCATGGAAG CAATTAGGAG TAGCTGTGTT ATTAGTGATC ATCGTACTTG GCCCATGGTG GGTGCGCAAC TACATTCACT ACAAAGAATT CATTCCATTA ACCGGCGGTT CTGGAGACCC GCTTCTGTTA GGCACTTATC AAGGATATGG ATACCGATAT GGCGAACCTT ATAAAGAAGT AATCAAAAAA ATCGATGAAC AATACCCGCA TATAAGCAAT TACGAAAAAC AAAAACTGGA AAAACAAATA GCGATAGAGC GAATAAAAAA ATGGTATCAC GCAAATCCGA AACAATTTAT CGAAAGCTAT ACAACTAAAA AAGCAAAAAT ACAATGGGAA CAGCCTTTTT ATTGGATTGA GATCTTAGGA GTTGCGAAAA ATACCATGAT TTCCGTCCAC CAATGGGTTG TATCACTGGC GTTTGCGTCC ATGGCACTCA CCTTGCTTTT GTTAAAGCGA AATCGAAAAG AATTGTTGTT TTTTACTTTC ATCATCGCGT ACTTTACCAT TTTAAACAAC GTGTTTTTCT CCTACCCGAG ATATAACCTG CCGTTAATGC CGCTTTTGTT TTTATATATC GGCCTGCTCG TTTCCGCTGT TCCGTTCATA CTTTTCCGCA AAAAAACAAC CTCCTCTTAA
|
Protein sequence | MRFPIKKRNI IVASMLLTAL VIRLFVLWKY GLDLTLNSDD MGYVRSGKRL LENGMLTYHH ESEPTVHIMP GMPILLAAIF FFFGTGDIGL YAAKVVMILF GVASVYLIYV IGKDMNQEWA GIIAGFFTAL FVPLIETDNL TLTEPPFLFG FLLFIHFAIQ LGRNHKMSTF YWLMFSYLFC LLFRATFALI PFALLGYFLL IKYPLRLAWK QLGVAVLLVI IVLGPWWVRN YIHYKEFIPL TGGSGDPLLL GTYQGYGYRY GEPYKEVIKK IDEQYPHISN YEKQKLEKQI AIERIKKWYH ANPKQFIESY TTKKAKIQWE QPFYWIEILG VAKNTMISVH QWVVSLAFAS MALTLLLLKR NRKELLFFTF IIAYFTILNN VFFSYPRYNL PLMPLLFLYI GLLVSAVPFI LFRKKTTSS
|
| |