Gene CPF_2781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2781 
SymbollysS 
ID4202360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3043495 
End bp3045000 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content32% 
IMG OID638083649 
Productlysyl-tRNA synthetase 
Protein accessionYP_697153 
Protein GI110800440 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAATG AGGAAATAAA TATTCACGAA GCAGAAGAAC AATTAAGTGA ACAAGAGATG 
TTAAGAAGAC AAAAGCTTGC TGAATTACAA GAAGCTGGTA AGGACCCATT TGATGTATAC
AAGGTAGAGA GAACACACTC TTCTGCTGAT GTAAAAGATA ATTTTGAAGA GTTAGAAGGT
AAAGAAGTAA AAGTTGCTGG AAGACTTATG TCAAAAAGAG GTCAAGGTAA AGTAGTATTC
GCTGATTTAG CAGATTTACC AGGAAAAATT CAATTATTTA TTAAGATTGA TAACGTTGGA
GAGGAAGCTT TAAAGGAATT TAAAACTTTC GATTTAGGTG ACTGGGTTGC TGCAACTGGA
GAAGTATTTA AAACAAAAAT GGGTGAAGTT TCAGTAAAAG TAACTTCATT TGAATTAATC
TGTAAATCTT TAAAACCATT ACCAGAAAAA TGGCATGGTT TAAAAGACCC AGACTTAAGA
TACAGACAAA GAGAAGTTGA TATAATAACA AACCCAGAAG TTAAAGATAC TTTTATTAAG
AGATCACAAA TAGTAAAAGC AATAAGAGAA TTCTTAGATA ATAGAGGATT CTTAGAGGTT
GACACACCAA TTCTTTCACC AATAGCTGGT GGAGCTGCTG CTAGACCTTT CATAACTCAC
CACAATGCTT TAGATATAGA TATGTATTTA AGAATTGCTA CTGAGTTATA CTTAAAGAGA
TTAATAGTTG CTGGATTTGA AAAAGTTTAT GAAATGGGTA AAAACTTCAG AAATGAAGGG
GTTTCAGTAA GACATAATCC AGAATTCACT GCTATAGAGT TATATGAAGC ATATGCTGAT
TACAATGACA TGATGGAAAT CATGGAAAAC ATGATTGCTT ATGTTTGTGA AAAAGTAAAT
GGATCAACTA AAGTTACTTA TGAAGGAACT GAAATAGACT TCACACCACC ATGGAGAAGA
ATTACTATGG TTGATGCAGT TAAAGAATTT GCTGGTATAG ATTTCAACGA AATTAAGAGT
GATGAAGAAG CTCAAGCTAT AGCTAAAGAG AAAAACTTAG AATTCCCTAA ACCATTAGAT
AAGGTTACTA AAGGTGAAGT TCTAAATATG TTATTTGAAG AATATGGTGA AGATAAATTA
ATCCAACCTA CATTCTTAAT AGATTATCCA GTAGAGATAT CACCTCTTAC TAAAAAGAAA
AGAGGAAATG AAATGTTTAC TGAGAGATTT GAAGGATTTG TATATGGTAG AGAAGTATGT
AACGCATACT CAGAGCTTAA CGATCCAATA GTTCAAAGAG AAAGATTTGA ACAACAAGCT
AGAGAAAGAG AATACGGAGA TGATGAAGCA TACATGTTAG ATGAAGAATT CATGAGTGCT
TTAGAAACTG GTATGCCTCC AACAGGTGGA TTAGGAATAG GAATAGACAG AATGATAATG
TTCTTAACTG ACTCTTCATC AATAAGAGAC GTTATATTAT TCCCAACAAT GAAACCACAA
AAATAG
 
Protein sequence
MSNEEINIHE AEEQLSEQEM LRRQKLAELQ EAGKDPFDVY KVERTHSSAD VKDNFEELEG 
KEVKVAGRLM SKRGQGKVVF ADLADLPGKI QLFIKIDNVG EEALKEFKTF DLGDWVAATG
EVFKTKMGEV SVKVTSFELI CKSLKPLPEK WHGLKDPDLR YRQREVDIIT NPEVKDTFIK
RSQIVKAIRE FLDNRGFLEV DTPILSPIAG GAAARPFITH HNALDIDMYL RIATELYLKR
LIVAGFEKVY EMGKNFRNEG VSVRHNPEFT AIELYEAYAD YNDMMEIMEN MIAYVCEKVN
GSTKVTYEGT EIDFTPPWRR ITMVDAVKEF AGIDFNEIKS DEEAQAIAKE KNLEFPKPLD
KVTKGEVLNM LFEEYGEDKL IQPTFLIDYP VEISPLTKKK RGNEMFTERF EGFVYGREVC
NAYSELNDPI VQRERFEQQA REREYGDDEA YMLDEEFMSA LETGMPPTGG LGIGIDRMIM
FLTDSSSIRD VILFPTMKPQ K