Gene GBAA_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3898 
Symbol 
ID2814854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3569637 
End bp3570623 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content38% 
IMG OID637790616 
Productcholoylglycine hydrolase family protein 
Protein accessionYP_020536 
Protein GI47529187 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3049] Penicillin V acylase and related amidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTACTA GTTTGACATT AGAGACAAAA AACGGTCAGC ATCTTTTTGC AAGAACGATG 
GACTTCACAT TAGATATGAA TCAAGAAGTA ATAATCATTC CTCGACATTA CCAGTGGAAT
AATATAACGG GTGAAATCAT TAATACGAAA CATGCTACGG TCGGAATGGG TATTAATCAT
CAAGGAAGGA TCATTATGGC GGACGGAGTA AATGAAGCAG GTATGACATG TGCAACACTC
TATTTTCCAG GATTCGCTAC TTATAGTCAA AGCATAGATG ACAACACAAC GAATTTGGCT
CCATTTGATT TTGTAACTTG GAGTCTGACA CAATTCAATT CTGTCAAAGA GTTAAAGAAA
TCTGTAGATA GCATTACCTT TTTGGATATA CCATTACCGG ATTTAGGACT TACGCCACCA
CTACATTGGA TTTTAGCGGA TAAATGGGGA GATTGCATTG TACTGGATCC GACAAGTGAA
GGATTAAAAT TGTATGATAA CCCACTAGGA GTGATGACGA ATAGTCCGGA GTTTAATTGG
CATTTACAAA ATTTAAGACA ATATATAGGC CTTAAATCGC AGCCATTCGC GCCAACAGAG
TGGAGTAATT TACCATTAAG TGCTTTTGGC CAAGGCTCGG GCTCAATGGG ACTTCCAGGG
GATTTCACCC CGCCATCGAG GTTTGTGCGG GCAGCATATG GCAAACAAAA CATTCAAGGT
ATAGATAGCG AAGAAGAGGG AGTATCAGCC CTTTTTCATA TCTTATCAAA TTGTGAGGTT
CCTAAAGGTG GAGTAATAAC AGAAGAAGGT GCATTAGATA ATACCATATA TACAAGCGTA
ATGTGTATGG AATCCGGAAC ATATTATTAT CATACTTACG ATTGTAGACA AATTATAGCT
GTTCATTTAT TTCATGAAAA TTTAGATACA GATGAGATTA AAGCCTATCC GTTCCAACGG
AAACAAAAAA TATTTTATGA GAACTAA
 
Protein sequence
MCTSLTLETK NGQHLFARTM DFTLDMNQEV IIIPRHYQWN NITGEIINTK HATVGMGINH 
QGRIIMADGV NEAGMTCATL YFPGFATYSQ SIDDNTTNLA PFDFVTWSLT QFNSVKELKK
SVDSITFLDI PLPDLGLTPP LHWILADKWG DCIVLDPTSE GLKLYDNPLG VMTNSPEFNW
HLQNLRQYIG LKSQPFAPTE WSNLPLSAFG QGSGSMGLPG DFTPPSRFVR AAYGKQNIQG
IDSEEEGVSA LFHILSNCEV PKGGVITEEG ALDNTIYTSV MCMESGTYYY HTYDCRQIIA
VHLFHENLDT DEIKAYPFQR KQKIFYEN