Gene BAS2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2003 
Symbol 
ID2848216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2009816 
End bp2011417 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content35% 
IMG OID637505253 
Producthypothetical protein 
Protein accessionYP_028266 
Protein GI49185014 
COG category[R] General function prediction only 
COG ID[COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTGA TGAGAAGATG TGTTGCCTTG CTGATTGTAT TTTTTATTAT GGCTCCAATG 
ATTAGCACAA ATGTGCGAGC AGAAGTTGTA AAAGAGCTTA AAACAGGTTT TCCAGATCAA
GAAGTCTTTA CACCTGGAGA ATGGTTTTTA GGCCAAAAGC CTGCTAATTA TGATGAAAGT
AAGCCTCCAA TTCTCTTTGT GCAAGGAAGG AATGGGAATG CTAATAGTTG GTATGGAAAG
ACAGTGTATC ACGATATAAA TGATATGTAT GACTATGCTT TGAAAGCAGG CTATCAAACA
GTATTTATAC AATTATATGA TGCTGCTGGG AAAGGGTCAG CTAGTCAGTG GGATAACGGA
AAATTGTTAG CACAAAAATT AGAAGAAATA TATAATCATT TCGGTAAAAA AGTTAATATT
GTAGCACATA GTAAAGGTGG TATCGATACA CAAGCTGCAT TAGTAGGATA TGGGGCAAAT
CAATTTGTCG GAAATGTTAT TACACTTGCG ACGCCACATC ATGGATCAAA CTTAGCGGAT
TTATCATATA GTTGGTGGGC AGGATGGCTT GCTTCTATAT TAGGTCAAAA AGATGATGGT
ACGTATTCAT TACAAATGGG CGAAATGGCA AAATTTCGTT CAACGATTGA TAATAATCCA
GCAGCTAAAT TAAACCGTTA TTATACAGTT ACTGGGACTA GCTGGGGCCC TGTATTTTCG
GCGTTATCAA TGGGTGGATT ATATTTGTCA TCATACGGTT CAAATGATGG ACTAGTAAAT
GAGTGGAGTG CTAAGCTGCC GTATGGAACG CATTTATTTA CAGATTCTAG ATTTGATCAT
GATAATATAC GAAAAGGATC TGCTGTTTTC TCACGAATTG AACCATATTT ACGTACAGCA
AATGTACCAG CTCCAGCTTT AGTAGCATCC AGCACTAGTT CAAATGAAAA TATAGAACAA
TTAAATACAA CTTCAAATCA AAATATTTTA GGAGGGGAAT TGCCACAAAA TCAGTGGATA
GAGCAAACCG TTACGGTTGA TAAAAAGGCA GAAGGAATTG TTTCTATATT AACTGCTTCG
TCGGATGTAG AAGTACAAAT GATATCACCA AAAGGAAAAG TGTATACAAA TAAGGATAGT
GTTATAACTA CTGGTGAAGA TGAATCTTTC TTTGGTGGCG CGACAATTAA AACATTTAAA
TTTGATAAAA TGGATGTAGG AGAATGGAAA GTTAAAATGA TGACGAAGCA GTCGAAAGAT
GCATATTTAG TTGTAAGCGA TTACAAAACT GGCGCACCAT TTGTTCTTCA AATGCCTACA
AAAGTAAAAG CAAATAAATC TGAGTATAAA CTGAAAAAAT CACCTGTGGC ACCTGAAATG
AAAGGAAATC TTTCCATAAC AGTAAGAGTC GTGAATAAAG AAGGGAAACT AGTCTCTGAA
TATAATGAAT TACAAAATGT GAATAACAAT ACATTTACTG GTGCTTTGAA GGACATAAAG
CAACCAGGAG TATATAACGT TACGATGGAT ATAAAAGGGA TGAATAAAGA AGGACAACCA
TATAATCGTA CAATTGTTAA GTCGGTTTAT GTGGAGAAAT AA
 
Protein sequence
MRLMRRCVAL LIVFFIMAPM ISTNVRAEVV KELKTGFPDQ EVFTPGEWFL GQKPANYDES 
KPPILFVQGR NGNANSWYGK TVYHDINDMY DYALKAGYQT VFIQLYDAAG KGSASQWDNG
KLLAQKLEEI YNHFGKKVNI VAHSKGGIDT QAALVGYGAN QFVGNVITLA TPHHGSNLAD
LSYSWWAGWL ASILGQKDDG TYSLQMGEMA KFRSTIDNNP AAKLNRYYTV TGTSWGPVFS
ALSMGGLYLS SYGSNDGLVN EWSAKLPYGT HLFTDSRFDH DNIRKGSAVF SRIEPYLRTA
NVPAPALVAS STSSNENIEQ LNTTSNQNIL GGELPQNQWI EQTVTVDKKA EGIVSILTAS
SDVEVQMISP KGKVYTNKDS VITTGEDESF FGGATIKTFK FDKMDVGEWK VKMMTKQSKD
AYLVVSDYKT GAPFVLQMPT KVKANKSEYK LKKSPVAPEM KGNLSITVRV VNKEGKLVSE
YNELQNVNNN TFTGALKDIK QPGVYNVTMD IKGMNKEGQP YNRTIVKSVY VEK