Gene BAS1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1034 
Symbol 
ID2849294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1089061 
End bp1090179 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content37% 
IMG OID637504292 
Producthypothetical protein 
Protein accessionYP_027306 
Protein GI49184054 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0320864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTGTA CAGCATTTAA AAAACTATGG GAGAGATACG AAAACGGAAC GCTCACGCAT 
AATGAACAAG AACTGTTAGA AAATCATATT GAGACATGTG AAGAATGCGA GGCTTACTTA
GATCAATTGC TTTCGAAGAG TGAACCAATA AAGAAAAAAC TACCACCACA AAAACTGAAA
GTCCCATTTT GGAAAATAAA ATGGAAACAA CGTTGGCAAA CCGTTAGTTT TGTCCTTGCC
GTTTGTATTG CAATCTATTT TGTTGGTCAT TTTTCATCTT CTCTTTACTT CTATAATATG
AAAAAGTTAG TCGAAGTAGA TGAGATTCCA GCACTCGCAC TAGAAGCAAC AATTCCAAAT
AGTCGTTCCG CTGGAGGCAG TACAAAGATT AAACCCTTTT TCCGTACAGA AAATGAAATG
AATTTAGTTA AAACGGTCGG TAAAAAAGAA ATGCCAATTG GTACAGTAAC AACGCGTAGT
TTCTTATCAT CTGTAACTGA CACAAATCAA TCATGGGCAA ATAAACCCTA TTCCAAAAAA
CTTTCCTTTG TTCACCCGAA AATCAAGCAA GATGATCATT TGAAAGAAAT CTCTAAAAAA
GTTTGGAGTA CACTCGGAAA GATACATGAG GGCACCGTTG CAGAAGTAGC AATATCTTTT
GACAAACCTT ACACTTTACA AGAGTTAGAA TCCATTCTAT ATAGCGCATT TGAAGCACAA
GAAATGCCGC CAACTCCTTT ATGGTACGCT TTAGACACAG GGCAAGAAAG AATAGATGAA
GAAGATTTCA TTCTACATGA CGGAGAGGTT ATCGGATTTT CAGAACATAT AAATCTCCCT
GATAATGAAG CAAAACGACC GAAGACAAAA GAAGATGAAG TAATCGAAAT GATGCGCATT
CTTTCTACAC ATAAAGAAAC TGTAAGTAAA ACTACTCGGA CTTCTGAAAA AGAGCTGAAC
TTAGATAAAC GTTATGAGTA TGTAAAAGAT AACGGTGTGA AAGTATACGG GATCGTCATT
ACCGGACCGT CGAAAGAGTT ATTAAAATTA CAAAACTCGC CTCACGTACG TTATGCGACT
CTTGGAGATA TTGAGGTTTG GAATTGGTTT AATCAGTGA
 
Protein sequence
MGCTAFKKLW ERYENGTLTH NEQELLENHI ETCEECEAYL DQLLSKSEPI KKKLPPQKLK 
VPFWKIKWKQ RWQTVSFVLA VCIAIYFVGH FSSSLYFYNM KKLVEVDEIP ALALEATIPN
SRSAGGSTKI KPFFRTENEM NLVKTVGKKE MPIGTVTTRS FLSSVTDTNQ SWANKPYSKK
LSFVHPKIKQ DDHLKEISKK VWSTLGKIHE GTVAEVAISF DKPYTLQELE SILYSAFEAQ
EMPPTPLWYA LDTGQERIDE EDFILHDGEV IGFSEHINLP DNEAKRPKTK EDEVIEMMRI
LSTHKETVSK TTRTSEKELN LDKRYEYVKD NGVKVYGIVI TGPSKELLKL QNSPHVRYAT
LGDIEVWNWF NQ