Gene BAS1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1973 
Symbol 
ID2851440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1977846 
End bp1979060 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content36% 
IMG OID637505223 
Producthypothetical protein 
Protein accessionYP_028236 
Protein GI49184984 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCA CGCTACGAGC ACTATATCAC GTTACGAAGT TATTACTAAT CCCTTTATGC 
CTAGTGTTAA CATTTGTGTA TGCTATGCTT CAAGGAGGAT TTGTAAGTTG GTTTTTATTT
TACAGTATGG TTCCTATTGG TCTTTATTCA CTTTTACTCC CCTTCTACGC TTTACGGGGT
GCTGAAGTAA AAAGAATAAC AAATCAAAAC GATTATGTAG CAGGAGAACG ATTTGTAAGC
ACGATTACAA TAAAAAGAAA ATTTCCTTTC CCCTTACTTT ATTTAGTTAT AGAAGATGAA
CTGCCACCAC ACCTTACAAG TTGTAGACAA ACAAAGATGA ATAAAACAAT ACTCTTTCCA
GGATTAAAAC GAAATATTTC GTTTCAATAT GCAATTGACA CAATCCCTAG AGGAGAGCAC
ACTTTTTCAA GCGTACGAGT CAAAACTGGT GATCTATTCA GTATGATGGA GAAAGAAGTA
ACTTTTTCAG TTCCGGATAC ATTTTTAATC TATCCTCAGT ATGTAGATAT AACGTATCAG
CAATTGGAAA ACCATTTCGA ACAAGGAGCG CTCTCAGCAA ATATAAATTT CACAAAAGAC
TCTACCATTT CTGTCGGTTT GAGAGACTAT AAACCTGGTG ACCGCTTTTC ATGGATTGAT
TGGAAAGCAA CTGCAAGAAC AAACAACATC ATGACGAAAG AGTTTGAACA ACAGCGTAGC
CATAATATTA TGATATTCAT AGACAGAACT GAGTCCCCTC TATTCGAATC AGTCGTCACA
TTTACTGCCT CTATTGTCAG GGCTGTATTG AAGCAAAATT CACCAGCGTC ATTTGTGTCT
GTGGGAAAAG AACGAACTTT TTTCCCTTTA GACAATGGAG ATAGTCAGTT GCAGCAAATC
TTTTGTCATT TAGCGAAAGT ACAAGCGGAC AGTGTATTCC CGCTCTCCCA GAGTGTAGAA
ATGGAATTAA GAAAAGTTTA TGAGCCCGTA ACAATTATAC TTGTGACAAG CGATCTTTCT
CCCGATATTC AAAAGGCGGC TGACTATACC GCTATACAAA ATAGAAAATT ACTAGTTTTT
ATTGTAAAAG AAAAACCAAA TCAACTCTCA CATCGAGAAC TAAGTATTTT AGAAACTCTA
CAAAAACGAA AAATATTTGT AAATGTAGTT TATGGAAACC AGTATACAAA CGTGTTTTTT
GAGGTGAGCA AATGA
 
Protein sequence
MKRTLRALYH VTKLLLIPLC LVLTFVYAML QGGFVSWFLF YSMVPIGLYS LLLPFYALRG 
AEVKRITNQN DYVAGERFVS TITIKRKFPF PLLYLVIEDE LPPHLTSCRQ TKMNKTILFP
GLKRNISFQY AIDTIPRGEH TFSSVRVKTG DLFSMMEKEV TFSVPDTFLI YPQYVDITYQ
QLENHFEQGA LSANINFTKD STISVGLRDY KPGDRFSWID WKATARTNNI MTKEFEQQRS
HNIMIFIDRT ESPLFESVVT FTASIVRAVL KQNSPASFVS VGKERTFFPL DNGDSQLQQI
FCHLAKVQAD SVFPLSQSVE MELRKVYEPV TIILVTSDLS PDIQKAADYT AIQNRKLLVF
IVKEKPNQLS HRELSILETL QKRKIFVNVV YGNQYTNVFF EVSK