Gene GBAA_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3957 
SymbolproS-2 
ID2819521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3633943 
End bp3635643 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content39% 
IMG OID637790671 
Productprolyl-tRNA synthetase 
Protein accessionYP_020596 
Protein GI47529247 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAA GTATGGTATT CAGTCCTACA TTACGTGAAG TTCCAGCTGA TGCGGAGATT 
AAGAGTCATC AGTTATTACT TCGTGCAGGT TTTATGCGTC AAAATGCTTC TGGTATTTAT
AGTTTTCTAC CATTTGGATT AAAAGTACTA CACAAAGTAG AACGTATCGT TCGAGAAGAG
ATGGAGCGCG CAGGTGCCGT AGAATTATTA ATGCCAGCGA TGCAAGCTGC AGAATTATGG
CAAGAGTCAG GTCGTTGGTA TTCTTACGGA TCTGAATTAA TGCGTATGAA AGATCGTAAC
GCTCGTGAAT TTGCATTAGG AGCGACACAT GAAGAAGTAA TTACAGATCT TGTACGTGAT
GAAGTGAAAT CGTATAAAAA ATTACCGTTA ACATTATATC AAATTCAAAC AAAATTCCGT
GATGAACAAA GACCTCGTTT CGGTTTATTA CGTGGAAGAG AGTTTCTAAT GAAAGATGCA
TACTCTTTCC ATGCTACGCA AGAGAGCTTA GATGAAGTGT ACGATCGCTT ATACAAAGCA
TACTCTAACA TCTTTGCTCG TTGTGGCTTG AATTTCCGTG CGGTTATTGC TGATTCTGGA
GCAATGGGTG GAAAAGATAC ACATGAATTT ATGGTATTAT CTGATGTTGG TGAAGATACA
ATTGCATACT CTGATACATC TGATTACGCA GCGAACATCG AAATGGCTCC TGTTGTAGCT
ACGTATACGA AGAGTGACGA AGCAGAAAAA GAGCTTGAAA AAGTAGCAAC ACCAGACCAA
AAAGCAATTG AAGAAGTATC TGCATTCTTA AACATCGAAG CTGACAAGTG CATTAAGTCT
ATGGTATTTA AAGTAGATGA GAAATTAGTA GTGGTACTTG TTCGTGGTGA TCATGAAGTA
AACGATGTAA AAGTGAAAAA TGTATACGGT GCTTCAGTTG TTGAGCTTGC CTCTCATGAA
GAAGTAAAAG AATTATTAAA TTGTGAAGTT GGTTCATTAG GACCGATTGG TGTAAATGGT
GATATCGAAA TTATCGCTGA TCACGCTGTA GCATCAATTG TCAACGGATG TTCAGGAGCG
AACGAAGAAG GATTCCATTA TGTAAATGTA AATCCAGAAC GTGACTTTAA AGTAAGTCAA
TATACGGATT TACGCTTCAT TCAAGAAGGA GACCAATCTC CAGACGGAAA CGGGACAATT
CTTTTCGCAC GCGGAATTGA AGTTGGTCAT GTATTCAAAT TAGGAACTCG TTATAGTGAA
GCAATGAACG CAACATTCCT AGATGAAAAC GGAAAAACAC AACCACTTAT TATGGGTTGT
TACGGCATTG GTGTGTCTCG TACAGTGGCA GCAATTGCAG AGCAGTTTAA TGATGAGAAC
GGTTTAGTTT GGCCAAAAGC TGTAGCACCG TTCCATGTGC ATGTAATTCC AGTGAATATG
AAATCTGATG CACAACGTGA AATGGGTGAA AACATCTACA ACTCATTACA AGAGCAAGGT
TATGAAGTAT TACTAGATGA TCGTGCAGAA CGTGCAGGTG TTAAATTTGC TGATGCTGAT
TTATTCGGCC TTCCAGTTCG CGTGACAGTT GGTAAAAAAG CAGACGAAGG TATTGTAGAA
GTGAAAGTAC GTGCTACAGG TGAGTCTGAA GAAGTAAAAG TAGAAGAACT TCAAACATAT
ATTGCTAATA TTTTAAAATA G
 
Protein sequence
MKQSMVFSPT LREVPADAEI KSHQLLLRAG FMRQNASGIY SFLPFGLKVL HKVERIVREE 
MERAGAVELL MPAMQAAELW QESGRWYSYG SELMRMKDRN AREFALGATH EEVITDLVRD
EVKSYKKLPL TLYQIQTKFR DEQRPRFGLL RGREFLMKDA YSFHATQESL DEVYDRLYKA
YSNIFARCGL NFRAVIADSG AMGGKDTHEF MVLSDVGEDT IAYSDTSDYA ANIEMAPVVA
TYTKSDEAEK ELEKVATPDQ KAIEEVSAFL NIEADKCIKS MVFKVDEKLV VVLVRGDHEV
NDVKVKNVYG ASVVELASHE EVKELLNCEV GSLGPIGVNG DIEIIADHAV ASIVNGCSGA
NEEGFHYVNV NPERDFKVSQ YTDLRFIQEG DQSPDGNGTI LFARGIEVGH VFKLGTRYSE
AMNATFLDEN GKTQPLIMGC YGIGVSRTVA AIAEQFNDEN GLVWPKAVAP FHVHVIPVNM
KSDAQREMGE NIYNSLQEQG YEVLLDDRAE RAGVKFADAD LFGLPVRVTV GKKADEGIVE
VKVRATGESE EVKVEELQTY IANILK