Gene CPF_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0234 
Symbol 
ID4203951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp283778 
End bp285256 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content26% 
IMG OID638081118 
Producthypothetical protein 
Protein accessionYP_694696 
Protein GI110800030 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0178449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ATGTAGCGCT CAGTAGTTTT TTAAACATAT CATTGAAATT TTTATTTCTT 
TTGATATTAG TAGAAAGCAT TGTTCATAGA AATTTATACT ATAAATTTTC AATTACTTCA
TTAGTGCTTT CTATTATAGC AATAAGCTTA GTAATATTTG TGTATTTTTA TTTAAAGAAA
AATTATAGCA AAAAACTATT ATTTATTATT CTTTTATCCG TAGGACTAAT ATTTAGAGTC
TTATGGTTTC TAAATTTAGA TAGTATTCCT GTAGGTGATT TTAACAGGAT GTTTATATGT
GCAGGTGAAT TTCTAACTGG AAGTAACTAT ATGTTTAGAG GAACAAGTTA CTTTGCAAGA
TTTCCACATA TGACAGCAAC AGTACTTTAT TTTGCCATAA TAAGAAATTT TTTTAGTAAC
CCCTTAATAG CTATCCGAAT TATAAACATA TTACTTTCAA TGTTTAACAT AATTTTGCTT
TATATGATTT CTAAAGAAAT TTTTAAAGAT GAAAAGAAAA GTTTTTGGGT TTTATTAATA
AGTGCCCTGT ATCCACCTAT GATATTATAT AACAATGTTT ATTGTTCAGA AAACTTAGCT
ATGCCACTTT TATTGCTAAG TGTACTTATG TTCTTTAAAT CAATAAACAA TAAACAAAAT
TTATTATACT TATGTCTATC TGGAATCTTT TTAAGCTTAT CTCATTTATT TAGACCTAAT
GGATATGTTT TTATAATAGC TTACATAATG TATTTATTTC TTTATTTCAA AGAAAACATT
ACTGTTAAGC TTAAAAATAT TCTAGTAGTA TTAGTATCAT TTATAGTTCC TTTTGTTTTA
TTTAGTACAC TGCTTATTAA ATTAAATATA ACTGAATATC CTCTTTGGCA TGGTACAGAA
CCACCAAGTA TTTCTATGCT AAAAGGAACA AATATAACTT CTGGTGGAAA GTGGAATGAA
GAGGACTTTA AAGTTTTTCA TGACTGTGAT GAAAACTATG AAAAGGCTGA TAAAAAGGCT
AAAGAAATAA TAAAAGACAG ATTAATAAAT ACTCCTAAAT TAGATCTTGC TAAATTCTAT
GTTTCAAAAT TTTCAAACTT CTGGAATAAT GGAAGTTTTG CTGGTGATTA CTGGTCTGAA
GCTGGATTAG ATGAAGCTTA TAATAAGGAA GATTACCTAA AAATGCTAGG AAAAGAAAAT
GGAAATATGA CTATAAGAAT CAGTGAAGAA GGAGTATTCT ATATTCAAAG TTTCTACATT
ATACTTCTTG CATTATCATA TGTTGGATTA TATAAAAATA AGTCAAAAAG AAAGAACTTA
ATTGATCTTC TTTATATACT TTTTGGTGGT ATGTCACTAC AGTTATTACT CATAGAAGCT
CAAGACAGAT ATTCATATCC TTTATCATGG ATATTTATAA TTCTTGCCAT GACTGCTTTT
AATCCAAAAG AAAATGAGGA GGCGTTAGAT TATGATTAA
 
Protein sequence
MKKNVALSSF LNISLKFLFL LILVESIVHR NLYYKFSITS LVLSIIAISL VIFVYFYLKK 
NYSKKLLFII LLSVGLIFRV LWFLNLDSIP VGDFNRMFIC AGEFLTGSNY MFRGTSYFAR
FPHMTATVLY FAIIRNFFSN PLIAIRIINI LLSMFNIILL YMISKEIFKD EKKSFWVLLI
SALYPPMILY NNVYCSENLA MPLLLLSVLM FFKSINNKQN LLYLCLSGIF LSLSHLFRPN
GYVFIIAYIM YLFLYFKENI TVKLKNILVV LVSFIVPFVL FSTLLIKLNI TEYPLWHGTE
PPSISMLKGT NITSGGKWNE EDFKVFHDCD ENYEKADKKA KEIIKDRLIN TPKLDLAKFY
VSKFSNFWNN GSFAGDYWSE AGLDEAYNKE DYLKMLGKEN GNMTIRISEE GVFYIQSFYI
ILLALSYVGL YKNKSKRKNL IDLLYILFGG MSLQLLLIEA QDRYSYPLSW IFIILAMTAF
NPKENEEALD YD