Gene CPF_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2008 
SymbolengA 
ID4202478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2248427 
End bp2249743 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content31% 
IMG OID638082877 
ProductGTP-binding protein EngA 
Protein accessionYP_696441 
Protein GI110798819 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03594] ribosome-associated GTPase EngA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC CAATAGTTGC TATGGTTGGA AGACCGAACG TAGGTAAGTC GACTCTTTTC 
AATAAATTAG CAGGAAAAAG AATTTCAATA GTACAAGATA CACCAGGGGT TACTAGAGAC
AGAGTATATG CAGAATCAGA ATGGTTAAAC AGAAAATTCA CAATGATAGA TACAGGTGGA
ATAGAGCCTG AAAGTAGTGA TATAATTGTT AAACAAATGA GAAGACAAGC ACAAATTGCT
ATAGAAATGG CTGATGTAAT AGTATTCGTT GTTGATGGTA AGGAAGGACT TACTGCTGCT
GACCAAGAAG TTGCACAAAT GCTTAGAAAA AGTAAAAAGC CTGTTGTTTT AGTAGTTAAT
AAAATAGATA GATTAGCTTT AGAAGAAAAT AGCTATGAGT TCTACAATTT AGGAATTGGA
GATCCTATAA CTATATCAGC ATCTCAAGGA TTAGGACTTG GAGATATGCT AGATGAAGTT
GTTAAATATT TTAATGATCC TTCAGAAGAT GAAGAGGATG ATGAATATAT TAGAATAGCT
ATGATAGGTA AACCAAATGT AGGTAAATCA TCACTTATAA ATAGATTATT AGGTGAAGAG
AGAGTTATAG TAAGTAATGT TCCAGGAACA ACAAGGGATT CTATAGATAG TTACTTAGAA
ACAGAAGATG GAAAATTCAT CTTAGTTGAT ACTGCTGGAT TAAGAAGAAA AAGTAAAGTA
AAAGAAGAAA TAGAAAGATA TAGTGTAATC AGAACTTATG CTGCTATAGA GAAAGCTGAT
GTAGCTATAC TTGTAATAGA TGCTGAGCAA GGAATAACTG AGCAAGATGA AAAAATAATA
GGATATGCTC ATGAAATGAA TAAAGCAATT ATGGTTGTTG TAAATAAATG GGATCTTATT
GAAAAAGATG ATAAAACATT AAGTAATTAT CAAAAAGACT TACAACAAAA ACTTAAGTTT
ATGCCATATG CTAAATACTT ATTCATATCA GCTTTAACAG GACAAAGAGT ACATAAAATA
TTATCAACAG CTAAATATTG CTATGATAAT TACTCTAAGA GAGTTTCAAC TGGATTATTA
AATGATGTTA TAAGTAAGGC TGTTTTAATG AAAGAGCCAC CAGTTGTAGC CTTAAAGAGA
TTAAAAATAT ACTATGCTAC TCAGGTTGCT ACAAAGCCAC CTAAGTTTGT GTTCTTTGTA
AATGACCCTA ATTTATTACA TTTCTCATAT GGTAGATATT TAGAAAACCA ATTAAGAGAA
AGTTTTGATT TTGATGGAAC TGGTATAGAA ATAGAATATA GAGCTAGAAA GGAGTAA
 
Protein sequence
MSKPIVAMVG RPNVGKSTLF NKLAGKRISI VQDTPGVTRD RVYAESEWLN RKFTMIDTGG 
IEPESSDIIV KQMRRQAQIA IEMADVIVFV VDGKEGLTAA DQEVAQMLRK SKKPVVLVVN
KIDRLALEEN SYEFYNLGIG DPITISASQG LGLGDMLDEV VKYFNDPSED EEDDEYIRIA
MIGKPNVGKS SLINRLLGEE RVIVSNVPGT TRDSIDSYLE TEDGKFILVD TAGLRRKSKV
KEEIERYSVI RTYAAIEKAD VAILVIDAEQ GITEQDEKII GYAHEMNKAI MVVVNKWDLI
EKDDKTLSNY QKDLQQKLKF MPYAKYLFIS ALTGQRVHKI LSTAKYCYDN YSKRVSTGLL
NDVISKAVLM KEPPVVALKR LKIYYATQVA TKPPKFVFFV NDPNLLHFSY GRYLENQLRE
SFDFDGTGIE IEYRARKE