Gene BAS4541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4541 
Symbol 
ID2850104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4446666 
End bp4448234 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content39% 
IMG OID637507778 
Producthypothetical protein 
Protein accessionYP_030788 
Protein GI49187535 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.136084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGAAA TTTGGTATGG CGGCAACATT TACACGATGA GGGAAGAAAA TGAAAAAGTA 
GAAGCTATTT ATGTTGAAAA TGGCAGGATC GTTGATAATG GAAGGAAAGA AGAGTTAGAA
AACCGATATG CTGTGGCTAA ATTGCACGAT TTAAAAGGCA AAACGATGAT TCCAGGCCTC
GTTGATAGCC ATATGCATCT TATTGGTCAC GGGGAGAGAT TACTTCGTTT AGATTTATCA
AATTGCACAT CTTATAGCGA AGTGCTGACT CTCGTTCGGA GGCGAGTAGA AGAAGCGCCG
AAAGGTTCTT GGATTATCGG AGAGGGCTGG AATGAAAATA ACTTTAAGGA TACGAAAGAT
GTTCACGCAA AAGATTTAGA TGCAATTTCA AGAGAACATC CCATTTTATT AAAGCGCGTT
TGTCGCCATG TTACATGGGT GAACTCATAC ATACTGCAAG AAGCGAACAT AACAGAAAAG
GCAAAAGATC CAAAAGGCGG GAAAGTTGGA CGGGACTCAT TCAATAAATT AACAGGACTT
TTATATGAAC AAGGCCAAGA GTTAATCAAA CATGTCCAGC CTGAAATTGA TGAATCCTAC
TTACAAAGAG CTTTGCAAAC AGCAATTAAA GACTGCTGGC AATATGGACT CGTTGGCGGG
CATACGGAAG ATTTAAATTA TTACGGTGGA TTTAGAAAAA CGCATAATGC GTTTTCTCAT
GTTATAAAAG AAATGCCATT TAAAGCACAT TTACTCGTTC ACCATGAAGT AGCACATGAA
CGAAAAGAAT ATGAAAATGA GCATTATATT GAGTTTGGGG CAATGAAAAT TTTTTCTGAC
GGTTCTTTTG GCGGAAGAAC AGCTTTATTA AGTGAACCGT ATGAAGATGC GAAGGAAACG
AATGGGGTTG CGATTTTCTC ACGTGAAGAA CTTGCGGAGT TAGTGAAAAA AGCACGAGAC
TTACATATGC CAGTTGCGAT TCATACTATC GGTGACTTAT CGCTTGAATA TGTCATTGAT
GTACTTGAAT TGTATCCGCC AGCAGAAGGA TTACGTGACC GCATTATTCA TTGTCAGCTA
GCTCGTGAAG AGTTGATTGA AAGAATGAAA AACTTACAAG CCATTATTGA TATACAACCA
GTCTTTGTTT CATCGGATTT TCCATCAGTC ATTGAAAAAC TGGGCGAGCA ACGTCTTCGT
TATGCCTACG CTTGGAAGAC GTTACTGGAG GCAGGATTAC ACTGTAACGG GGGATCAGAT
GCTCCGATTG AGCAAGTGAA TCCGTTTCTA GGCATATATA GCGCTGTTAC ACGTAGAAGT
TTTATTGACG GTTTATGTTA TATGCCAGAA GAAAGATTAA CGGTATATGA GGCTGTTTCT
TTATTTACAA CAGGAAGTGC CTATGCAATT GGAAAAGAAG CGAAGCGAGG GCAAATTACA
AAAGGATATG AGGCAGACTT TACAATATTA GACCGCGATA TTTTTGAAAT AGAGGCAGAA
GAAATAAAAG AAGTACAGGC AGAAATGACC GTAATAGATG GCCAAGTCGT CTATAGAAAA
GATTCATAA
 
Protein sequence
MGEIWYGGNI YTMREENEKV EAIYVENGRI VDNGRKEELE NRYAVAKLHD LKGKTMIPGL 
VDSHMHLIGH GERLLRLDLS NCTSYSEVLT LVRRRVEEAP KGSWIIGEGW NENNFKDTKD
VHAKDLDAIS REHPILLKRV CRHVTWVNSY ILQEANITEK AKDPKGGKVG RDSFNKLTGL
LYEQGQELIK HVQPEIDESY LQRALQTAIK DCWQYGLVGG HTEDLNYYGG FRKTHNAFSH
VIKEMPFKAH LLVHHEVAHE RKEYENEHYI EFGAMKIFSD GSFGGRTALL SEPYEDAKET
NGVAIFSREE LAELVKKARD LHMPVAIHTI GDLSLEYVID VLELYPPAEG LRDRIIHCQL
AREELIERMK NLQAIIDIQP VFVSSDFPSV IEKLGEQRLR YAYAWKTLLE AGLHCNGGSD
APIEQVNPFL GIYSAVTRRS FIDGLCYMPE ERLTVYEAVS LFTTGSAYAI GKEAKRGQIT
KGYEADFTIL DRDIFEIEAE EIKEVQAEMT VIDGQVVYRK DS