Gene BAS4505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4505 
Symbol 
ID2850363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4420554 
End bp4421867 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content37% 
IMG OID637507743 
Productthioesterase family protein 
Protein accessionYP_030753 
Protein GI49187500 
COG category[K] Transcription 
COG ID[COG4109] Predicted transcriptional regulator containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCTACCA AGCATAACCA AATTTTAGAA CATATTAATA GCCTGCCAGT AGGGCATAAA 
ATTTCTGTAA GGCAAATTGC AAAAGATTTG AGTGTAAGTG AAGGGACAGC TTATCGTGCA
ATTAAAGATG CAGAAAATAA AGGGTATGTT AGTACGATTG AACGTGTCGG AACAATTCGA
ATTGAACAAA AGAAGAAAGA AAATATTGAA AAGCTGACAT ATGCAGAAGT CGTTAACATT
GTTGATGGTC AAGTACTTGG AGGCAGAGAA GGACTACATA AAACGTTAAA TAAATTCGTA
ATCGGCGCAA TGAAATTAGA AGCGATGATG CGCTATACAG AAGCTGGAAA CTTACTTATT
GTCGGTAATC GTACGAACGC ACATCAATTA GCGTTAGAAA CCGGAGCTGC GGTGTTAATT
ACGGGCGGAT TTGATACGGA AGATCATGTG AAGAAATTAG CAGATGAATT AAAACTGCCG
ATTATTTCAA GTAGTTACGA TACATTTACA GTTGCAACGT TAATTAACCG TGCGATTTAC
GATCAGCTTA TTAAGAAAGA AATTGTACTC GTTGAAGATA TTTTAACGCC AATTGAAGAG
ACGTTATATT TAAAACCAAA TGATACAGTG CAGCAATGGC ATGCATATAA CGAAGAGACG
ATGCACGGAA GGTATCCAAT TGTTGATGAA AATAAAAAGG TATTAGGTAT TGTGACTTCT
AAGGATATGA TTGGTGTTGC AAAAGAAACA CCAATTGATA AGGTAATGAC AAGGCATCCA
ATTACGGTAA ATGGCAAAAT GTCTGTCGCA GCTGCGGCAC GTATGATGGT GTGGGAAGGT
ATTGAGTTAC TTCCTGTTGT TGATGAAGGA AATAAGTTGC AAGGTATTAT TAGCCGTCAA
GATGTACTTC AGGCGTTGCA AATGATTCAG CGTCAACCGC AAGTAGGCGA AACAATCGAT
GATATTGTAA CGAATCAATT TATGACGCCG AAAGAAGCGA AAAATGAGCA TTTATATCAA
TTTTCAGTGA CGCCGCAAAT GACGAATTCA ATCGGAACGC TATCTTACGG TGTATTCGCA
ACGATTGTGA CAGAAGCAAC GAATCGCGTT ATTCGTGCGC AAAAGAAGAG TGATTTAATT
GTTGAGAACT TAACAATTTA TTTCGTAAAA CCAGTTCAAA TTGACAATGT TGTATCGGTT
CATCCGAAAG TATTAGAAAT TGGACGTAAA TTTGGTAAGG TTGATGTAGA GGTGCATCAT
GAAGGTAATG TTGTTGGAAA AGCATTACTT ATGGTGCAGT TAATTGATAA ATAA
 
Protein sequence
MATKHNQILE HINSLPVGHK ISVRQIAKDL SVSEGTAYRA IKDAENKGYV STIERVGTIR 
IEQKKKENIE KLTYAEVVNI VDGQVLGGRE GLHKTLNKFV IGAMKLEAMM RYTEAGNLLI
VGNRTNAHQL ALETGAAVLI TGGFDTEDHV KKLADELKLP IISSSYDTFT VATLINRAIY
DQLIKKEIVL VEDILTPIEE TLYLKPNDTV QQWHAYNEET MHGRYPIVDE NKKVLGIVTS
KDMIGVAKET PIDKVMTRHP ITVNGKMSVA AAARMMVWEG IELLPVVDEG NKLQGIISRQ
DVLQALQMIQ RQPQVGETID DIVTNQFMTP KEAKNEHLYQ FSVTPQMTNS IGTLSYGVFA
TIVTEATNRV IRAQKKSDLI VENLTIYFVK PVQIDNVVSV HPKVLEIGRK FGKVDVEVHH
EGNVVGKALL MVQLIDK