Gene BAS4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4159 
Symbol 
ID2851289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4076096 
End bp4077259 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content39% 
IMG OID637507395 
Productcystathionine beta-lyase 
Protein accessionYP_030408 
Protein GI49187156 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATT CTATAGATAC ACTCTTACTA CACAACCAAT ATAAACATGA TCCACAAACA 
GGAGCTGTTA ACGTTCCCAT CTATAACACA TCAACATTCC ACCAGTTCGA TGTAGATACG
TTCGGGAAAT ATGACTATAG CAGGTCAGGA AATCCAACTC GTGAAGCTCT TGAAGACATC
ATTGCTTTAT TAGAAGGCGG AACGAAAGGA TTTGCCTTCG CATCAGGAAT TGCAGCGATT
TCTACTGCAT TCCTCCTTCT TTCACAAGGT GATCACGTAC TCATTTCAGA AGACGTATAC
GGAGGGACTT ATCGAATAAT AACTGAAGTT CTCTCCCGTT ATGGTGTTTC ACATACATTT
GTTGATATGA CCAATCTAGA AGAAATCAAG CAAAATATAA AACAAAATAC AAAGCTCTTT
TATGTAGAAA CACCTTCTAA CCCGCTTTTA AAAGTAACAG ATATTCGTGC TGTTTCTACA
CTTGCAAAAT CTATTGGCGC TCTTACTTTT GTTGATAATA CATTTTTGAC ACCACTATTC
CAGAAACCAC TTGATCTCGG CGCAGATGTC GTTCTTCATA GCGCTACAAA GTTCATTGCT
GGTCACAGTG ATGTTACTGC TGGATTAGCG GTCGTAAAAG ATGCCGAACT TGCTCAAAAA
CTTGGATTTT TACAAAATGC ATTCGGCGCC ATTTTAGGAC CTCAAGATTG CTCTCTCGTA
CTTCGCGGTC TAAAAACATT ACATGTACGT CTTGAGCATT CAGCTGCGAA TGCCAATAAA
ATTGCACAGT ATTTACAAGA GCACAGTAAA ATTCAAAATG TCTATTATCC TGGCTTACAA
ACACATCTTG GATTTGATAT TCAACAATCT CAAGCAACAT CGGCCGGAGC TGTCCTATCC
TTCACTTTAC AATCAGAAGA TGCACTCCGC CAATTTTTAT CAAAAGTAAA ATTACCTGTC
TTTGCAGTTA GTTTAGGAGC TGTCGAATCG ATTCTTTCCT ATCCGGCTAA AATGTCACAT
GCAGCACTGT CACAAGAAGC TCGTGATGAA AGAGGTATTT CCAATTCATT ACTTCGTTTA
TCCGTCGGCC TTGAAAATGT TGATGATTTA ATATCCGACT TTGAAAATGC CCTTTCTTAT
GTAGAAGAAC CTGTAAATGC ATAG
 
Protein sequence
MSYSIDTLLL HNQYKHDPQT GAVNVPIYNT STFHQFDVDT FGKYDYSRSG NPTREALEDI 
IALLEGGTKG FAFASGIAAI STAFLLLSQG DHVLISEDVY GGTYRIITEV LSRYGVSHTF
VDMTNLEEIK QNIKQNTKLF YVETPSNPLL KVTDIRAVST LAKSIGALTF VDNTFLTPLF
QKPLDLGADV VLHSATKFIA GHSDVTAGLA VVKDAELAQK LGFLQNAFGA ILGPQDCSLV
LRGLKTLHVR LEHSAANANK IAQYLQEHSK IQNVYYPGLQ THLGFDIQQS QATSAGAVLS
FTLQSEDALR QFLSKVKLPV FAVSLGAVES ILSYPAKMSH AALSQEARDE RGISNSLLRL
SVGLENVDDL ISDFENALSY VEEPVNA