Gene BCAH820_3831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_3831 
SymbolproS2 
ID7188922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp3668702 
End bp3670402 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content39% 
IMG OID643557242 
Productprolyl-tRNA synthetase 
Protein accessionYP_002452781 
Protein GI218904947 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.08087e-60 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAACAAA GTATGGTATT CAGTCCTACA TTACGTGAAG TTCCAGCTGA TGCGGAGATT 
AAGAGTCATC AGTTATTACT TCGTGCAGGT TTTATGCGTC AAAATGCTTC TGGTATTTAT
AGTTTTCTAC CATTTGGATT AAAAGTACTA CACAAAGTAG AACGTATCGT TCGAGAAGAG
ATGGAGCGCG CAGGTGCCGT AGAATTATTA ATGCCAGCGA TGCAAGCTGC AGAATTATGG
CAAGAGTCAG GTCGTTGGTA TTCTTACGGA TCTGAATTAA TGCGTATGAA AGATCGTAAC
GCTCGTGAAT TTGCATTAGG AGCGACACAT GAAGAAGTAA TTACAGATCT TGTACGTGAT
GAAGTGAAAT CGTATAAAAA ATTACCGTTA ACATTATATC AAATTCAAAC AAAATTCCGT
GATGAACAAA GACCTCGTTT CGGTTTATTA CGTGGAAGAG AGTTTCTAAT GAAAGATGCA
TACTCTTTCC ATGCTACGCA AGAGAGCTTA GATGAAGTGT ACGATCGCTT ATACAAAGCA
TACTCTAACA TCTTTGCTCG TTGTGGCTTG AATTTCCGTG CGGTTATTGC TGATTCTGGA
GCAATGGGTG GAAAAGATAC ACATGAATTT ATGGTATTAT CTGATGTTGG TGAAGATACA
ATTGCATACT CTGATACATC TGATTACGCA GCGAACATCG AAATGGCTCC TGTTGTAGCT
ACGTATACGA AGAGTGACGA AGCAGAAAAA GAGCTTGAAA AAGTAGCAAC ACCAGACCAA
AAAGCAATTG AAGAAGTATC TGCATTCTTA AACATCGAAG CTGACAAGTG CATTAAGTCT
ATGGTATTTA AAGTAGATGA GAAATTAGTA GTGGTACTTG TTCGTGGTGA TCATGAAGTA
AACGATGTAA AAGTGAAAAA TGTATACGGT GCTTCAGTTG TTGAGCTTGC CTCTCATGAA
GAAGTAAAAG AATTATTAAA TTGTGAAGTT GGTTCATTAG GACCGATTGG TGTAAATGGT
GATATCGAAA TTATCGCTGA TCACGCTGTA GCATCAATTG TCAACGGATG TTCAGGAGCG
AACGAAGAAG GATTCCATTA TGTAAATGTA AATCCAGAAC GTGACTTTAA AGTAAGTCAA
TATACGGATT TACGCTTCAT TCAAGAAGGA GACCAATCTC CAGACGGAAA CGGGACAATT
CTTTTCGCAC GCGGAATTGA AGTTGGTCAT GTATTCAAAT TAGGAACTCG TTATAGTGAA
GCAATGAACG CAACATTCCT AGATGAAAAC GGAAAAACAC AACCACTTAT TATGGGTTGT
TACGGCATTG GTGTGTCTCG CACAGTGGCA GCAATTGCAG AGCAGTTTAA TGATGAGAAC
GGTTTAGTTT GGCCAAAAGC TGTAGCACCG TTCCATGTGC ATGTAATTCC AGTGAATATG
AAATCTGATG CACAACGTGA AATGGGTGAA AACATCTACA ACTCATTACA AGAGCAAGGT
TATGAAGTAT TACTAGATGA TCGTGCAGAA CGTGCAGGTG TTAAATTTGC TGATGCTGAT
TTATTCGGCC TTCCAGTTCG CGTGACAGTT GGTAAAAAAG CAGACGAAGG TATTGTAGAA
GTGAAAGTAC GTGCTACAGG TGAGTCTGAA GAAGTAAAAG TAGAAGAACT TCAAACATAT
ATTGCTAATA TTTTAAAATA G
 
Protein sequence
MKQSMVFSPT LREVPADAEI KSHQLLLRAG FMRQNASGIY SFLPFGLKVL HKVERIVREE 
MERAGAVELL MPAMQAAELW QESGRWYSYG SELMRMKDRN AREFALGATH EEVITDLVRD
EVKSYKKLPL TLYQIQTKFR DEQRPRFGLL RGREFLMKDA YSFHATQESL DEVYDRLYKA
YSNIFARCGL NFRAVIADSG AMGGKDTHEF MVLSDVGEDT IAYSDTSDYA ANIEMAPVVA
TYTKSDEAEK ELEKVATPDQ KAIEEVSAFL NIEADKCIKS MVFKVDEKLV VVLVRGDHEV
NDVKVKNVYG ASVVELASHE EVKELLNCEV GSLGPIGVNG DIEIIADHAV ASIVNGCSGA
NEEGFHYVNV NPERDFKVSQ YTDLRFIQEG DQSPDGNGTI LFARGIEVGH VFKLGTRYSE
AMNATFLDEN GKTQPLIMGC YGIGVSRTVA AIAEQFNDEN GLVWPKAVAP FHVHVIPVNM
KSDAQREMGE NIYNSLQEQG YEVLLDDRAE RAGVKFADAD LFGLPVRVTV GKKADEGIVE
VKVRATGESE EVKVEELQTY IANILK