Gene BAS4849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4849 
Symbol 
ID2849230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4733956 
End bp4735176 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content38% 
IMG OID637508107 
Productaminotransferase, class V 
Protein accessionYP_031092 
Protein GI49187839 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTC ATGAAATACG CAAACAGTTT CCAATTCTTG ATCAAAAAGT GAACGGCAAA 
CAACTTGTTT ATTTCGATAG TGCAGCAACT TCTCAAAAAC CAATTCAAGT CATTGAAACG
TTAGAACGTT ACTATAAAGA ATATAATTCT AACGTGCATC GCGGTGTTCA TACGCTCGGT
ACGAAAGCTA CCGATGCGTA TGAAGGTGCA CGTGAGAAAG TTCGCAAGTT TATTAATGCG
AAATCAATGG AAGAGATTAT TTTCACACGC GGAACGACAA CTGCATTAAA TACAGTAGCG
GCTAGTTATG GTCTTGAAAA TGTAAAAGAA GGCGATGAAA TCGTTATTTC TTACATGGAG
CATCATAGTA ACATCATTCC GTGGCAACAA GTTGCGAAGA AAACTGGTGC AACTTTAAAA
TATCTTCCGC TTCAACCAGA TGGGACAATT TCAATAGAAG ATGCTCGTCA AACAATTACA
CCGAATACAA AAATCGTTTC TATCATGTAT GTATCTAACG TACTTGGAAC GATTAACCCT
GTAAAAGAAA TCGGAGCAAT CGCACACGAA AACGGTGCAA TTATGGTCGT TGACGGTGCA
CAAAGTACAC CTCATATGAA AGTGGATGTA CAAGATTTAA ATTGTGATTT CTACGCATTA
TCTGCTCATA AGATGTGCGG ACCTACAGGT ATCGGCGTAT TATATGGTAA GAAAGAATTG
CTAAACAATA TGGAGCCAAT TGAATTTGGC GGTGAAATGA TTGATTTCGT AGATTTACAA
GAATCTACTT GGAAAGAGCT TCCGTGGAAG TTTGAAGCAG GTACGCCGAT TATCGGTAAT
GCAATCGGAC TTGGTGCGGC AATTGATTTC CTAGAAGAAA TCGGTCTTGA TAATATTGAA
AAGCATGAGC ATGAATTAGC GCAATACGCT TTAGAAAGAC TATCAGAAGT AGATGGCGTT
ACAATTTATG GTCCAAAGCA TCGCGCTGGT CTTGTTACAT TTAATATTGA AGATGTACAT
CCTCACGATG TAGCGACAGT ATTAGATGTA GAAGGTATCG CGGTTCGCGC AGGACACCAC
TGTGCACAAC CGCTTATGAA GTGGCTGAAA GCTTCTTCAA CAGCACGTGC GAGCTTCTAT
TTATATAATA CAAAAGAAGA AATTGATACA TTTGTTGAAT CGCTAATCAA GACAAAGGAG
TATTTCACAA ATGTCATTTA A
 
Protein sequence
MNIHEIRKQF PILDQKVNGK QLVYFDSAAT SQKPIQVIET LERYYKEYNS NVHRGVHTLG 
TKATDAYEGA REKVRKFINA KSMEEIIFTR GTTTALNTVA ASYGLENVKE GDEIVISYME
HHSNIIPWQQ VAKKTGATLK YLPLQPDGTI SIEDARQTIT PNTKIVSIMY VSNVLGTINP
VKEIGAIAHE NGAIMVVDGA QSTPHMKVDV QDLNCDFYAL SAHKMCGPTG IGVLYGKKEL
LNNMEPIEFG GEMIDFVDLQ ESTWKELPWK FEAGTPIIGN AIGLGAAIDF LEEIGLDNIE
KHEHELAQYA LERLSEVDGV TIYGPKHRAG LVTFNIEDVH PHDVATVLDV EGIAVRAGHH
CAQPLMKWLK ASSTARASFY LYNTKEEIDT FVESLIKTKE YFTNVI