Gene BAS4747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4747 
Symbol 
ID2851456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4629460 
End bp4630908 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content41% 
IMG OID637507981 
ProductO-succinylbenzoic acid--CoA ligase 
Protein accessionYP_030991 
Protein GI49187738 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01923] O-succinylbenzoate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGAGA CGATGCCAAA TTGGTTAAAG CAACGTGCAT TTTTAACACC AGATCGCACT 
GCAATTGAAA TAGAGGAAGA GAAAGTTACT TTTATGCAGC TGCATGAAAA AGTAGTATCT
GTTTGTGAAC ACCTCACGCA TGTAGGAGTG AATCGTGGGC AAAAGGTGGC TGTTCTGATG
AAAAATGGTA TGGAGATGAT TACAGTTATT CACGCCCTAT CTTACGTAGG TGCAGTAGCT
GTGCTTTTAA ATACGCGTCT TTCAAGAGAA GAGCTACTTT GGCAAATGGA TGATGCTGAA
GTGATTTGTT TAGTGACAGA TCAAGATTTT GAGGCTAAAG ATATTCCTGT CTATTCATTC
GCCGAAGTGA TGAATGGACC AAAAGAGGAA GCCTCTATAC AAGAAGAATT CTCTTTAAGA
GAAGCGATGA CAATTATTTA TACGTCAGGT ACGACTGGAA AACCGAAAGG CGTTATTTTA
ACGTACGGGA ATCACTGGGC AAGCGCGGTT GGTTCTTCGC TTAATTTAGG ACTTCGTGAT
GATGATTGCT GGTTAGCTTG TATGCCGATG TTCCACGTTG GCGGGCTATC TCTTTTAATG
AAAAATATTA TGTACGGCAT GCGCATTTTA CTCGTTCCGA AATATGATGC TGATTTTATT
CATAAAGCAC TTCAAACGAG AGGCGTTACG ATTATTTCTG TCGTTTCTAA AATGTTAACT
GATTTATTAG AGCGACTTGG AGAAGGAACA TATCCATCTT CTTTCCGATG TATGTTACTT
GGCGGAGGAC CAGCGCCGAA ACCGTTATTA GAAACGTGTG TAGATAAAGG GATTCCTGTA
TATCAAACGT ACGGTATGAC AGAAACGTCT TCGCAAATTT GTACGTTATC CGCGGATTAC
ATGTTAACGA AAGTAGGATC AGCCGGCAAA CCACTATTTC AATGCCAACT TCGTATTGAA
AAAGACGGCG TAGTAGTGCC GCCGTTTGCA GAAGGCGAGA TTGTCGTAAA AGGACCAAAC
GTAACAGGCG GTTACTTTAA CCGTGAAGAT GCAACGCGCG AGACTATTCA AAACGGATGG
CTTCATACTG GCGACCTCGG TTATTTAGAT GAAGAAGGAT TTTTATACGT ATTAGATCGC
CGCAGTGATT TAATTATTTC TGGCGGAGAG AATATATATC CGGCTCAAAT TGAAGAAGTG
TTGCTTTCTC ATCCGATGGT AGCGGAAGCT GGTGTTGTCG GTATGACTGA CGATAAATGG
GGACAAGTAC CCGCTGCTTT TGTTGTAAAA AGTGGAGAGA TAACAGAAGA AGAAATTCTT
CATTTTTGCG AGGAGAAATT AGCGAAATAT AAAGTGCCGA AAAAAGCGTG CTTCTTAGAA
GAATTACCAC GAAATGCTTC GAAAAAATTG TTAAGACGAG AGTTAAGACA ATTAGTGGAG
GAGATGTAA
 
Protein sequence
MMETMPNWLK QRAFLTPDRT AIEIEEEKVT FMQLHEKVVS VCEHLTHVGV NRGQKVAVLM 
KNGMEMITVI HALSYVGAVA VLLNTRLSRE ELLWQMDDAE VICLVTDQDF EAKDIPVYSF
AEVMNGPKEE ASIQEEFSLR EAMTIIYTSG TTGKPKGVIL TYGNHWASAV GSSLNLGLRD
DDCWLACMPM FHVGGLSLLM KNIMYGMRIL LVPKYDADFI HKALQTRGVT IISVVSKMLT
DLLERLGEGT YPSSFRCMLL GGGPAPKPLL ETCVDKGIPV YQTYGMTETS SQICTLSADY
MLTKVGSAGK PLFQCQLRIE KDGVVVPPFA EGEIVVKGPN VTGGYFNRED ATRETIQNGW
LHTGDLGYLD EEGFLYVLDR RSDLIISGGE NIYPAQIEEV LLSHPMVAEA GVVGMTDDKW
GQVPAAFVVK SGEITEEEIL HFCEEKLAKY KVPKKACFLE ELPRNASKKL LRRELRQLVE
EM