Gene BAS3172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3172 
Symbol 
ID2848828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3143896 
End bp3145224 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content36% 
IMG OID637506416 
ProductCBS domain-containing protein 
Protein accessionYP_029429 
Protein GI49186177 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.903732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAAATAT TTAATTTAGT CATGGTTGCG ATTTTAATCG CATTTACTGG ATTTTTCGTA 
GCAGCTGAGT TTGCGATTGT AAAAGTACGT TCAAGTCGTA TTGATCAGCT TGTTGCAGAA
GGAAAACGCG GTGCTTTAGC AGCGAAAAAG GTAACAACAA ATTTAGATGA ATATTTATCT
GCTTGTCAAT TAGGTATTAC AGTTACAGCT ATGGGATTAG GTTGGTTAGG TGAACCGACA
ATTGAAAAGT TATTACACCC GTTATTTGAG AAATGGAACT TAAACCCTTC TATTTCATCA
GTATTAACAT TTGGTCTTGC TTTTATGTTA ATGACGTATT TACACGTTGT AGTAGGGGAA
TTAGCTCCTA AAACGATGGC AATTCAAAAG GCTGAAAAAG TAACATTATT ATTTGCAGCT
CCACTAATGA TGTTCTATAA AGTGATGTAT CCATTTATTT GGGTATTAAA TGGTTCAGCT
CGTGTGATAA CTGGTTTATT CGGTTTAAAA CCGGCTTCTG AACATGAAGT AGCTCATACA
GAGGAAGAAT TACGCCTTAT TCTTTCAGAT AGCTATGAAA GTGGCGAAAT TAATCAAGCT
GAATACAAGT ATGTAAATAA CATTTTTGAA TTTGATAATC GTATTGCAAA AGAGATTATG
GTACCGCGAA CAGAAATCGT TGGTTTCTAC CTGGAAGATT CAGTAGAAGA ACACATGAAA
GTAATCCAAA ATGAGCGATA CACACGTTAT CCGATTTTTG GAGAAGATAA AGATGATATT
ATCGGTATGG TCAACGTAAA AGATTTCTTT ATTCGATATA TGACCGAGGA TCAAAAAGAT
TTATCATCCA TTCGCTCGTA TATGCGTCCG ATTATTGAAG TGATGGAAAC AACTCCAATT
CACGATTTAT TACTTCAAAT GCAGAAGAAG CGAATTCCGA TGGCTGTTTT ATATGATGAG
TACGGAGGAA CAGCTGGAAT TGTAACGTTT GAGGACATCT TGGAGGAAAT CGTCGGCGAA
ATTCGTGATG AATATGATGA AGATGAAGCA CCACCAATTC AACATGTGAA CGAGCAACAT
ATCATTGTTG ATGGAAAAGT GCTTATCTCA GAAGTGAAAG ATTTATTTGG ATTACACATT
GAAGAAGATG ATGTGGATAC AATCGGTGGA TGGATTATGA TGCAAAATCA TGAAATCGAA
GAAGGACAAC ACGTTGAGGC GGAAGGTTAT GAATTTAAAG TGTTAGAAAA AGACGCTTAC
CAAATTAAAC GTGTTGAAAT TCGTAAGATG GAAGAGGAAC AAGAAGAAGA AAAAGCAGCA
ACTGTGTAA
 
Protein sequence
MEIFNLVMVA ILIAFTGFFV AAEFAIVKVR SSRIDQLVAE GKRGALAAKK VTTNLDEYLS 
ACQLGITVTA MGLGWLGEPT IEKLLHPLFE KWNLNPSISS VLTFGLAFML MTYLHVVVGE
LAPKTMAIQK AEKVTLLFAA PLMMFYKVMY PFIWVLNGSA RVITGLFGLK PASEHEVAHT
EEELRLILSD SYESGEINQA EYKYVNNIFE FDNRIAKEIM VPRTEIVGFY LEDSVEEHMK
VIQNERYTRY PIFGEDKDDI IGMVNVKDFF IRYMTEDQKD LSSIRSYMRP IIEVMETTPI
HDLLLQMQKK RIPMAVLYDE YGGTAGIVTF EDILEEIVGE IRDEYDEDEA PPIQHVNEQH
IIVDGKVLIS EVKDLFGLHI EEDDVDTIGG WIMMQNHEIE EGQHVEAEGY EFKVLEKDAY
QIKRVEIRKM EEEQEEEKAA TV