Gene CPF_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2121 
SymbolychF 
ID4203404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2351900 
End bp2352997 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content32% 
IMG OID638082986 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_696549 
Protein GI110799604 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAG GAATGGTTGG ATTACCAAAT GTTGGTAAAA GTACATTATT CAATGCTATT 
ACAAAAGCAG GAGCTGAATC AGCAAACTAT CCTTTCTGTA CAATAGAACC AAACGTAGGT
GTTGTAAGTG TACCAGATAA GAGATTAGAT GTTTTAGAAA AAATATATAA TACAAAGAAA
AAAGTATATA CTGCAATAGA GTTTTATGAT ATAGCAGGAT TAGTTAAAGG TGCTAGTAAA
GGTGAAGGAT TAGGAAATAA ATTCTTATCA CACATAAGAG AAGTTGCAGC TATCGTTCAT
GTTGTAAGAT GCTTTGACGA TGAGAATGTT GTTCACGTAG AAGGTTCTGT AGATCCAATA
AGAGATATAG AAACTATAAA CTTAGAACTT ATCTTTGCAG ACTTAGATGT TCTTGAAAGA
AGAATGGAAA AGACTATGAA GTTAGTAAGA TCAGGTGATA AAACTGCTAA GTTTGAGTAT
GATGTAATGG AAAGATTAAA AGCTCACTTA GAAGCAAATA AACCAGCTAG AACTTTAGAA
GCTACTGAAG ATGAAGAAGC TTTCGTAAAA AGCTTATTCT TAATAACTTC AAAACCAGTT
TTATATGCTT GTAACATATC AGAAGATGAT ATGATGGAAG GAAACTTAGA TAATGAATAT
GTTCAAAAAG TTAGAGCATA TGCTGAAACT GAGAATTCAG GAATCATGGT TGTTTGCGCT
AAACTTGAAG AGGAATTATC AGGATTAGAT GAAGAAGAAA AAGCTGAAAT GTTATCTGAG
TATGGATTAG AAGAATCAGG TCTTGATAAA CTTATACAAG CAAGTTACAA ATTATTAGGA
TTAATGAGTT ACTTAACTGC AGGGGTACAA GAAGTTAGAG CTTGGACAAT AAAACAAGGA
ACTAAAGCTC CACAAGCAGC AGGTAAAATT CACTCTGATA TAGAAAGAGG ATTCATAAGA
GCTGAGGTTG TTTCTTATGA TGATTTAGTA GAATGTGGTT CAGAAGCAGC TGCTAAAGAA
AAAGGTGTTT ACAGATTAGA AGGTAAAGAA TACGTAATGA AAGATGGAGA TATAGTTAAC
TTCAGATTCA ACGTATAA
 
Protein sequence
MKLGMVGLPN VGKSTLFNAI TKAGAESANY PFCTIEPNVG VVSVPDKRLD VLEKIYNTKK 
KVYTAIEFYD IAGLVKGASK GEGLGNKFLS HIREVAAIVH VVRCFDDENV VHVEGSVDPI
RDIETINLEL IFADLDVLER RMEKTMKLVR SGDKTAKFEY DVMERLKAHL EANKPARTLE
ATEDEEAFVK SLFLITSKPV LYACNISEDD MMEGNLDNEY VQKVRAYAET ENSGIMVVCA
KLEEELSGLD EEEKAEMLSE YGLEESGLDK LIQASYKLLG LMSYLTAGVQ EVRAWTIKQG
TKAPQAAGKI HSDIERGFIR AEVVSYDDLV ECGSEAAAKE KGVYRLEGKE YVMKDGDIVN
FRFNV