Gene CPF_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2033 
SymbolalaS 
ID4201044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2272989 
End bp2275628 
Gene Length2640 bp 
Protein Length879 aa 
Translation table11 
GC content32% 
IMG OID638082902 
Productalanyl-tRNA synthetase 
Protein accessionYP_696466 
Protein GI110799076 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCA TGGGAGCAAA TGAATTAAGA GAAAAATATT TAAGCTTTTT TGAAAGCAAA 
GATCATTTAA GATTACAGTC ATTTCCGTTA GTACCTAAAA ATGATAAGAG TTTATTATTA
ATAAATGCAG GTATGGCACC ACTTAAACCT TATTTCACAG GTTTAGAAGA ACCACCAAAA
AGAAGAATAA CAACTTGCCA AAAGTGTATA AGAACTGGTG ATATAGAGAA TGTTGGTAAA
ACATCAAGAC ATGGTACATT CTTCGAAATG CTAGGAAACT TTTCATTTGG AGATTACTTC
AAATCAGAAA TAATTCCTTG GGCTTGGGAG TTTATAACAG AAACTTTAGG AATTCCAAAG
GATAAATTAT ATGTAACTAT ATATTTAAAT GACGATGAAG CTTATGATAT TTGGACTAGT
AAAACTGATG TAGATCCAAG CAGAATATTC AGATTAGGAA AAGATGATAA CTTCTGGGAA
ATAGGGGTAG GTCCTTGTGG TCCTTGTACA GAGATTCACT TTGATAGAGG AGAAGGAAAG
GTTGAAACTG TAGAAGAATT CTTAGAAGCT TCAGATGCTG ATAGAATAGT TGAGTTCTGG
AACTTAGTTT TCACTCAATT TGATAAAGAT GAAGAAGGAA ACTACAATGA GTTAGCTCAA
AAGAACATAG ATACAGGTAT GGGCTTAGAA AGAATAGCTA CAATAATGCA AGGTGTAGAT
AATATCTTCG AAATAGATAC AGTTAAAAAC ATATTAAATA AAGCATGTGA ATTAACAAAT
GCTAAATATG GAGAAGATAA AGACAAAGAT GTATCATTAA GAATAATAAC TGACCATGGA
AAAAGTGTTA CTTTCTTAAT ATGTGACGGT GTTCAACCAT CAAATGAAGG TAGAGGATAT
GTTTTAAGAA GACTTCTTAG AAGAGCTGCT AGACATGGAA GACTTTTAGG AGTTAAAGGT
ATATTCTTAA ATGAAATGGT TGATGCTGTA GTTGAAAATT ACGGTGAAGC TTATCCAGAA
TTAAAAGAAA AAGCTGATTA CATTAAGAAA ATAATTAAAT TAGAAGAAGA AAGATTTAAT
GAAACAATAG ACTCAGGTAT GGATATACTT ATGAGCTATA TTTCAGAAAT GGAAGAAAAA
AATGAGAAAG TTTTATCAGG AGCTAAAGCT TTCAAATTAT ATGATACTTA TGGATTCCCT
CTAGAGCTTA CTCAAGAGAT ATTAGAAGAA AAAGGATTAG AGTTAGATAT AGAAAACTTT
AATAAGGAAA TGAAAGAGCA AAGAGAAAGA GCTAGAAATG CTAGAGGAGA AAGTAGCTAC
ATGGGAAGCG AAGAAAGTCC AGTAAACAAA GTGGATGCTT CAATAGTTAC TGAATTTGAT
GGATATGTTA ATTTAGAACT TAACTCAAAA GTTATAGTAC TAGGAAATAA TGAAGAATTT
AAATCTGAAC TTAAAGAAGG TGAAGAAGGA TTCTTATTAA CTGATAAAAC TCCTTTCTAT
GCTGAAATGG GAGGTCAAGT TGGAGATAGA GGAAATATAA CTTCAGAAAC TGGAATGGCA
ATAGTTACAG ATTGTAAGAA AAATGTTGGT GGAAAATTTG TTCACTACAT TAAGGTTATA
GAAGGTAGCT TAAAAGAAGG TCAAGAAGTT AAATTATCAG TTGATGCTTC AAGAAGATCT
AACATATGTA AAAACCACAC AGCTACACAC ATGTTACATG AAGCTTTAAA AGAAGTTTTA
GGAGACCATG TAAATCAATC AGGTTCATAT GTTGATGAAG AAAGATTAAG ATTTGACTTT
ACTCATTTTG CAGCTTTAAC AGAAGAAGAA TTAGAAAAAG TTGAATTATT AGTAAATGAA
AAAATAATGA CTGTATCTGT AGTTGATACA AAGGAAATGT CATTAGATGA AGCTAGAAAT
AGTGGAGCAA CTTGTCTTTT CGATGAAAAG TATGCTGAAA AAGTAAGAGT TGTTTCAGTT
GGAGATTTCT CAAAAGAATT ATGTGGAGGA ACTCACGTAG CTAACTCAGG AGAAATTGGA
TTATTTAAGA TAGTTTCAGA ATCAGGAGTT GCTGCTGGAA TAAGAAGAAT TGAGGCTGTT
ACTGGAATAA GCGCATTAAA ATTCATGGAA CTTAAAAATA ATATGCTTAA GGAAGCTGCT
TCAATGCTTA AGTGTAATGA AAAAGATATT GCTAAGAGAA TAGCAGCTCA AGCTCATGAA
TTAAAAGAAA AAGATAAAGA AATAGCAGAA CTTAAAGCTA AATTAGTTCA AGGTGCTGAG
GATGATATCT TAAAAGACAA AGTTGAAATA AACGGAGTTG AATTAGTTAC AGCAGAATTA
AAAGATGTTG ATGGAAATTC ATTAAGAGAT TTAGCTGATA AAGTTAGAAA TAAATTAAAT
AATGGAATAG TTGTTTTAGC AAGTGACAAT GGTGGAAAAG TAAACTTAGT AGCTATGGCA
ACTAAAAATT CATTAGCTAA TGGAGTTCAT TGTGGAAAGG TAATAAAAGA AGTTGCAGCA
GTTGTAGGCG GCGGCGGAGG TGGAAGACCT GACATGGCTC AAGCTGGTGG AAAAAATCCA
GAAAATATAG CTAAAGCATT AGAAAAAGCA AAAGAAGTTG TAGAATTACT TGTAAAATAG
 
Protein sequence
MKFMGANELR EKYLSFFESK DHLRLQSFPL VPKNDKSLLL INAGMAPLKP YFTGLEEPPK 
RRITTCQKCI RTGDIENVGK TSRHGTFFEM LGNFSFGDYF KSEIIPWAWE FITETLGIPK
DKLYVTIYLN DDEAYDIWTS KTDVDPSRIF RLGKDDNFWE IGVGPCGPCT EIHFDRGEGK
VETVEEFLEA SDADRIVEFW NLVFTQFDKD EEGNYNELAQ KNIDTGMGLE RIATIMQGVD
NIFEIDTVKN ILNKACELTN AKYGEDKDKD VSLRIITDHG KSVTFLICDG VQPSNEGRGY
VLRRLLRRAA RHGRLLGVKG IFLNEMVDAV VENYGEAYPE LKEKADYIKK IIKLEEERFN
ETIDSGMDIL MSYISEMEEK NEKVLSGAKA FKLYDTYGFP LELTQEILEE KGLELDIENF
NKEMKEQRER ARNARGESSY MGSEESPVNK VDASIVTEFD GYVNLELNSK VIVLGNNEEF
KSELKEGEEG FLLTDKTPFY AEMGGQVGDR GNITSETGMA IVTDCKKNVG GKFVHYIKVI
EGSLKEGQEV KLSVDASRRS NICKNHTATH MLHEALKEVL GDHVNQSGSY VDEERLRFDF
THFAALTEEE LEKVELLVNE KIMTVSVVDT KEMSLDEARN SGATCLFDEK YAEKVRVVSV
GDFSKELCGG THVANSGEIG LFKIVSESGV AAGIRRIEAV TGISALKFME LKNNMLKEAA
SMLKCNEKDI AKRIAAQAHE LKEKDKEIAE LKAKLVQGAE DDILKDKVEI NGVELVTAEL
KDVDGNSLRD LADKVRNKLN NGIVVLASDN GGKVNLVAMA TKNSLANGVH CGKVIKEVAA
VVGGGGGGRP DMAQAGGKNP ENIAKALEKA KEVVELLVK