Gene BAS3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3881 
Symbol 
ID2848092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3829314 
End bp3830573 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content41% 
IMG OID637507118 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_030131 
Protein GI49186879 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCATTTG AATTTAAACT ACCAGATATC GGTGAAGGTA TCCACGAAGG TGAAATCGTA 
AAATGGTTTA TTAAACCAGG CGACGAAGTA AACGAAGACG ACGTACTTCT TGAAGTACAA
AATGATAAAG CAGTAGTAGA AATTCCTTCT CCTGTTAAAG GTAAAGTACT TGAAGTACTT
GTAGAAGAAG GTACGGTTGC AGTAGTTGGA GATACATTAA TTAAATTTGA TGCTCCAGGA
TACGAAAACC TTAAATTTAA AGGCGACGAT CATGACGAAG CTCCTAAAGC TGAAGCTACT
CCAGCAGCAA CTGCAGAAGT AGTAAATGAG CGCGTAATCG CTATGCCATC TGTTCGTAAA
TATGCTCGTG AAAACGGCGT AGACATTCAT AAAGTAGCTG GTTCTGGTAA GAACGGTCGT
ATCGTAAAAG CTGACATCGA TGCATTTGCA AATGGTGGAC AAGCAGTAGC AGCAACTGAG
GCTCCAGCAG CAGTAGAAGC TACTCCAGCA GCAGCGAAAG AAGAAGCACC AAAAGCACAA
CCAATCCCAG CTGGTGAATA TCCAGAAACT CGTGAGAAAA TGAGTGGTAT CCGTAAAGCA
ATTGCGAAAG CAATGGTTAA CTCTAAACAT ACAGCTCCTC ACGTAACATT AATGGATGAA
GTAGATGTAA CTGAACTTGT TGCTCACCGT AAGAAGTTCA AAGCTGTGGC AGCTGACAAA
GGTATTAAAT TAACTTACCT TCCATACGTT GTTAAAGCTT TAACATCTGC ATTACGTGAA
TACCCAATGT TAAACACTTC TTTAGATGAT GCTTCTCAAG AAGTAGTTCA TAAACATTAC
TTCAACATCG GTATCGCAGC TGATACAGAC AAAGGTCTAT TAGTACCAGT TGTTAAAGAT
ACAGATCGCA AGTCTATCTT CACAATTTCT AACGAGATCA ATGATCTTGC TGGTAAAGCA
CGTGAAGGTC GTTTAGCTCC TGCTGAAATG AAAGGCGCTT CTTGCACAAT TACAAACATT
GGTTCTGCAG GTGGACAATG GTTCACTCCA GTTATCAACC ACCCAGAAGT AGCAATCCTT
GGTATCGGCC GTATCGCTGA GAAACCAGTT GTGAAAAACG GTGAGATCGT TGCAGCTCCA
GTATTAGCAT TATCTCTAAG CTTTGACCAT CGTTTAATTG ACGGCGCAAC TGCTCAAAAA
GCATTAAACC AAATTAAACG TCTATTGAAT GACCCACAAT TATTAGTAAT GGAGGCGTAA
 
Protein sequence
MAFEFKLPDI GEGIHEGEIV KWFIKPGDEV NEDDVLLEVQ NDKAVVEIPS PVKGKVLEVL 
VEEGTVAVVG DTLIKFDAPG YENLKFKGDD HDEAPKAEAT PAATAEVVNE RVIAMPSVRK
YARENGVDIH KVAGSGKNGR IVKADIDAFA NGGQAVAATE APAAVEATPA AAKEEAPKAQ
PIPAGEYPET REKMSGIRKA IAKAMVNSKH TAPHVTLMDE VDVTELVAHR KKFKAVAADK
GIKLTYLPYV VKALTSALRE YPMLNTSLDD ASQEVVHKHY FNIGIAADTD KGLLVPVVKD
TDRKSIFTIS NEINDLAGKA REGRLAPAEM KGASCTITNI GSAGGQWFTP VINHPEVAIL
GIGRIAEKPV VKNGEIVAAP VLALSLSFDH RLIDGATAQK ALNQIKRLLN DPQLLVMEA