Gene BAS1429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1429 
Symbol 
ID2852692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1455501 
End bp1456763 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content34% 
IMG OID637504685 
ProductTPR domain-containing protein 
Protein accessionYP_027698 
Protein GI49184446 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGT TTGAACAAGC TGTTTCATAT ATTGAAAATG GTGAAGCGGA AAAAGGATTA 
CAATTGTTAA AAGAACAATT AAAAATTGCG AATGATGAAG AGAAGTATGA TATCGCTCGT
TACTATCATA CACTGGGATT TACGGATGAA GCGTTATCTA TTACAGAAGA CTTACGTTTA
TTGTATCCAG AAGAAAGTGA ATTCACTGTA TTTTTAGCAG AATTATATAT TGATCTAGAC
AAAGAAGATG AAGCGATTGA AGTGCTTCAT GATATTCCAG AAAATGATGA TTTATATGTT
CAATCGTTAT TACTAGTTGC GGATTTATTC CAAATGCAAG GTTTTGATGA TGTAGCAGAA
CAAAAACTAT TAAAGGCGAA AGAAATGATG CCTGACGAAC CTGTCATTAC GTTTGGATTA
GCAGAGTTAT ATAGTAGTAA AGGTGAAGAA CAAAAGGCAA TCACTTATTA TGAGTCGCTA
TTATCGGAAC ATAAAGTAAT GGGTGGTGTT GTCATTGCAC TACGCCTTGG AGAAACGTTA
AGTGCGATTG GAAATTGGGA AGAGGCGATT TCTTACTACG AAGCAGGTTT AGAAGAACAA
AAAGATATCC ACTCATTGTT TGGATATGCC TTCACATTAT ATCAAGGTGA AGAATACCAA
AGAGCAATTG GTGCTTGGCA AGAACTAAAA GAATTAGATC CTGAGTATGC ATCTCTTTAC
ATGTATTTAG CGAAAAGCTA TGAAAAAGAA GGAATGCTTC AAGAAAGCTA TGAAACACTT
CATGAAGGAA TTAAAGTAGA TGAACTTTCT GTACCATTTT ATGTAGAATT AGCGAACATT
GCAGCGAAAT TAGGGAAAAT AGCGGAAGCA GAGGAAGTGC TTCAAAAAGC GCTTGAGTTA
GATCCAGGAC ATTTAGGTGC AACATTAAAA TATGCATATA TCTTAAAGGA ACAAGAAAAG
TATGAAGAGC TAATTGCCGT TGTAGAGCGT GCTATCGATA GTGGAGAACC AGATACACAA
CTACTTTGGG ATCTTGCGTT TGCAAAAAAA CAATTAGAAA TGTATTCTGA TGCATTAAAA
CACTATGAAA GTGCATATAC TTCTTTTAAG AATCATCCAG ACTTCTTGGA AGAGTACGGT
TATTTCTTAT TGGAGGAAGG TATGCAAAAA GAGGCGAAAG AAGTATTTAC TCAGTTAATA
CAACTAGACC CGACACAAAT TCATATTGAA GAATTGTTAT ATAATTTAGA GGATTTTTCA
TAA
 
Protein sequence
MQKFEQAVSY IENGEAEKGL QLLKEQLKIA NDEEKYDIAR YYHTLGFTDE ALSITEDLRL 
LYPEESEFTV FLAELYIDLD KEDEAIEVLH DIPENDDLYV QSLLLVADLF QMQGFDDVAE
QKLLKAKEMM PDEPVITFGL AELYSSKGEE QKAITYYESL LSEHKVMGGV VIALRLGETL
SAIGNWEEAI SYYEAGLEEQ KDIHSLFGYA FTLYQGEEYQ RAIGAWQELK ELDPEYASLY
MYLAKSYEKE GMLQESYETL HEGIKVDELS VPFYVELANI AAKLGKIAEA EEVLQKALEL
DPGHLGATLK YAYILKEQEK YEELIAVVER AIDSGEPDTQ LLWDLAFAKK QLEMYSDALK
HYESAYTSFK NHPDFLEEYG YFLLEEGMQK EAKEVFTQLI QLDPTQIHIE ELLYNLEDFS