Gene GWCH70_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3053 
Symbol 
ID7977416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3072335 
End bp3073594 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content40% 
IMG OID644799847 
Producthypothetical protein 
Protein accessionYP_002950986 
Protein GI239828362 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTC CAATTAAAAA AAGAAACATC ATCGTCGCCA GTATGCTGTT AACCGCACTG 
GTGATTCGGC TTTTTGTTCT ATGGAAGTAT GGATTAGACC TTACATTAAA TAGCGATGAT
ATGGGATATG TAAGAAGCGG CAAAAGACTC TTGGAAAATG GGATGCTCAC ATACCACCAT
GAAAGCGAGC CAACTGTCCA TATTATGCCA GGGATGCCGA TCTTATTGGC TGCTATTTTT
TTCTTTTTCG GCACTGGCGA TATAGGTCTT TACGCAGCAA AAGTGGTGAT GATTTTATTT
GGCGTAGCCA GTGTCTATCT TATCTATGTA ATCGGAAAAG ATATGAACCA AGAATGGGCT
GGCATCATCG CTGGATTTTT CACTGCACTA TTTGTTCCAC TTATTGAAAC GGACAATTTA
ACTTTGACCG AACCGCCTTT TCTATTCGGA TTTCTTCTTT TCATTCATTT CGCTATCCAA
CTTGGACGAA ATCATAAAAT GTCTACTTTC TATTGGCTGA TGTTTTCCTA CTTGTTCTGC
TTGTTGTTTC GAGCTACCTT TGCTCTGATT CCATTTGCGC TTCTCGGCTA CTTTCTGCTC
ATCAAATATC CGCTTCGGCT CGCATGGAAG CAATTAGGAG TAGCTGTGTT ATTAGTGATC
ATCGTACTTG GCCCATGGTG GGTGCGCAAC TACATTCACT ACAAAGAATT CATTCCATTA
ACCGGCGGTT CTGGAGACCC GCTTCTGTTA GGCACTTATC AAGGATATGG ATACCGATAT
GGCGAACCTT ATAAAGAAGT AATCAAAAAA ATCGATGAAC AATACCCGCA TATAAGCAAT
TACGAAAAAC AAAAACTGGA AAAACAAATA GCGATAGAGC GAATAAAAAA ATGGTATCAC
GCAAATCCGA AACAATTTAT CGAAAGCTAT ACAACTAAAA AAGCAAAAAT ACAATGGGAA
CAGCCTTTTT ATTGGATTGA GATCTTAGGA GTTGCGAAAA ATACCATGAT TTCCGTCCAC
CAATGGGTTG TATCACTGGC GTTTGCGTCC ATGGCACTCA CCTTGCTTTT GTTAAAGCGA
AATCGAAAAG AATTGTTGTT TTTTACTTTC ATCATCGCGT ACTTTACCAT TTTAAACAAC
GTGTTTTTCT CCTACCCGAG ATATAACCTG CCGTTAATGC CGCTTTTGTT TTTATATATC
GGCCTGCTCG TTTCCGCTGT TCCGTTCATA CTTTTCCGCA AAAAAACAAC CTCCTCTTAA
 
Protein sequence
MRFPIKKRNI IVASMLLTAL VIRLFVLWKY GLDLTLNSDD MGYVRSGKRL LENGMLTYHH 
ESEPTVHIMP GMPILLAAIF FFFGTGDIGL YAAKVVMILF GVASVYLIYV IGKDMNQEWA
GIIAGFFTAL FVPLIETDNL TLTEPPFLFG FLLFIHFAIQ LGRNHKMSTF YWLMFSYLFC
LLFRATFALI PFALLGYFLL IKYPLRLAWK QLGVAVLLVI IVLGPWWVRN YIHYKEFIPL
TGGSGDPLLL GTYQGYGYRY GEPYKEVIKK IDEQYPHISN YEKQKLEKQI AIERIKKWYH
ANPKQFIESY TTKKAKIQWE QPFYWIEILG VAKNTMISVH QWVVSLAFAS MALTLLLLKR
NRKELLFFTF IIAYFTILNN VFFSYPRYNL PLMPLLFLYI GLLVSAVPFI LFRKKTTSS