Gene BAS3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3521 
Symbol 
ID2851817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3487586 
End bp3488725 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content33% 
IMG OID637506762 
Producthypothetical protein 
Protein accessionYP_029775 
Protein GI49186523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000274826 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAA AAATGGAAGT AACAGCTATT CAGTTTAAAA AGATGGAAGA TCCATTAGGG 
AGAAGTACAA AAAAGAAGTA TATTTGCTAT GTAAATGTAA ATGATGTACC GAAGGATATT
CCAATGGCAA CAAATCCACG AGAGCAAAAA TTAACTAAAA GTGTCCCAAA GCAAATTGAA
GATTCATTAC TATCTGATGA TGGTGAGTTT CATTTGAAAA ATCGTGGGAT TGTAATTTCT
GCTAAAAAAG TAGAATATAA TACTCAAACA AAGAAAATGA CATTACTTTT TGATGATTTC
TATGAGCATG GAAACATTGA TGGTGGACAT ACTTATAAAG TAATTTTGAA GCATCAAGGA
AAAGGGCTTC AACAATATGT TCAATTTGAA ATCATGGTAG GTGTAGAAGA TATAATTGAA
CCATTAGCAG CTGCTAGAAA TACATCTACA CAAGTAGATG AAAAATCTAT AGCTGAGTTA
GAAGGGAAAT TTGCACCTAT CAAAGATTCT ATTGGAGGTA TGCCTTTCTA TAATAGGGTG
GCATTTAAAC AAAATCAGCA TTCAGATCGA AGAGGAGTAA AGGTAATAGA TGCACGTGAG
ATTGTAGCAA TTATGACTAT GTTTAATATA GACAGTTATG GACCAGATAC TCATCCAACT
GCTTCGTATT CTAGTAAAGC AAAAGTACTC TCAGAGTATT TAAAGGACCA ATCTGAGTTT
GAAAAAATGC ATAATATTGC TCCAGACATG TTTGATTTAT ATAGCAAGAT TGAAATGGAT
TTTCCTGTTG CATATAATGC AACAGGTGGA AAATATGGGG CGAAGAAATT TTCTGGCTAT
AAAGAGGGGA ATGTAGTAGC CAAGTTGAAG TTTGGGGACG AGCCTTTAGA ATATAAAGTA
CCGGATGGTT TAGTGTATCC TATTTTAAGT GCTTTTAGAG CTTTAGTAAC TTTAGATGAA
AAAACTAATA TGTATCGATG GGTTAAAGAT CCTTTTGATG TATATGAAGA GATAAGAGTG
CAATTGGCAA GTAAAATTAT GAAATTTACA GAGTCCATTG GTAACAATCC TAATGCCGTA
GGGAAAGATA CAAACGCCTG GGATATGATG TATATGACAG TTGAACGTTA TGTAAAATAA
 
Protein sequence
MKSKMEVTAI QFKKMEDPLG RSTKKKYICY VNVNDVPKDI PMATNPREQK LTKSVPKQIE 
DSLLSDDGEF HLKNRGIVIS AKKVEYNTQT KKMTLLFDDF YEHGNIDGGH TYKVILKHQG
KGLQQYVQFE IMVGVEDIIE PLAAARNTST QVDEKSIAEL EGKFAPIKDS IGGMPFYNRV
AFKQNQHSDR RGVKVIDARE IVAIMTMFNI DSYGPDTHPT ASYSSKAKVL SEYLKDQSEF
EKMHNIAPDM FDLYSKIEMD FPVAYNATGG KYGAKKFSGY KEGNVVAKLK FGDEPLEYKV
PDGLVYPILS AFRALVTLDE KTNMYRWVKD PFDVYEEIRV QLASKIMKFT ESIGNNPNAV
GKDTNAWDMM YMTVERYVK