Gene GBAA_4933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4933 
Symbol 
ID2821121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4478800 
End bp4479915 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content41% 
IMG OID637791605 
Productaminopeptidase 
Protein accessionYP_052649 
Protein GI50196961 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATC CACGCATTGA AAAGTTAGCA TACAATTTAA TTAACTACTC TATTCGCTTA 
CAAAAAGGCG AAAAAGTATT AATTGAAAAC TTTGGCTTAC AAAAAGAACT TGTAACTGCA
CTTGTAAAAG AAGCATATGC AGCTGGTGGT TTCCCATTCG TTTCTTTAAA AGATCATCAA
GTAGATCGCT CTTTATTAAT GGGTGCTACT GAAGAACATT TCGAACAAAT CGCAGCGTAT
GAAGCAAGCG TAATGAAAGA TATGGACGCT TATATCGGTC TTCGCTCTGG TGATAACATT
AACGAACAAG CTGACGTACC AAGTGAGAGA ATGCAAATTC ACGGTCAAAC AGTTGGTAAG
AAAGTTCATA GAGACATCCG CGTTCCAAAA ACACGCTGGG TTGTTCTTCG CTACCCAAAT
GCTTCTATGG CACAGCTTGC GAAAATGAGC ACAGAAGCTT TCGAAGACTT CTACTTCGAA
GTATGTAACT TAGACTACGG TAAAATGGAT AAGGCGATGG ATAGCCTTGT TACATTAATG
AATAAAACAG ATAAAGTGCG CCTAACTGGA CCTGGAACTG ACTTAACATT CTCTATTAAA
GACATTCCAG CAATTAAATG CTCAGGTCAT TTAAACATTC CAGACGGTGA AGTATACTCT
GCACCCGTTC GTGATTCTGT TAACGGTACA GTTTCTTACA ACACGCCATC TCCTTACAAC
GGTTATACAT TTGAAAATGT ACAACTTAAG TTCGAGAACG GCCAAATCGT TGAAGCAACT
GCAAACGATA CAGAACGCAT TAACAAAATC TTCGATATAG ACGAAGGCGC ACGCTACGTT
GGTGAGTTCG CAATCGGCGT AAACCCATAC ATCTTGCATC CAATGGGAGA TATCCTATTC
GATGAAAAAA TCGATGGCAG CTTCCACTTC ACTCCTGGAC AAGCTTACGA CGATGCATGG
AACGGTAACA ACTCGAACAT TCACTGGGAT TTAGTATGCA TCCAACGCCC TGAATACGGC
GGCGGTGAAA TTTACTTCGA CGACGTACTA ATCCGTAAAG ACGGACGCTT CGTTGTACCT
GAATTAGAAG CTTTAAATCC AGAGAACTTA AAATAA
 
Protein sequence
MKDPRIEKLA YNLINYSIRL QKGEKVLIEN FGLQKELVTA LVKEAYAAGG FPFVSLKDHQ 
VDRSLLMGAT EEHFEQIAAY EASVMKDMDA YIGLRSGDNI NEQADVPSER MQIHGQTVGK
KVHRDIRVPK TRWVVLRYPN ASMAQLAKMS TEAFEDFYFE VCNLDYGKMD KAMDSLVTLM
NKTDKVRLTG PGTDLTFSIK DIPAIKCSGH LNIPDGEVYS APVRDSVNGT VSYNTPSPYN
GYTFENVQLK FENGQIVEAT ANDTERINKI FDIDEGARYV GEFAIGVNPY ILHPMGDILF
DEKIDGSFHF TPGQAYDDAW NGNNSNIHWD LVCIQRPEYG GGEIYFDDVL IRKDGRFVVP
ELEALNPENL K