Gene CPF_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0478 
Symbol 
ID4202380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp567223 
End bp568551 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content21% 
IMG OID638081360 
Productglycosyl transferase, group 1 
Protein accessionYP_694933 
Protein GI110799188 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.038549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAT TACATTATAA TTTAGGGTTA CCGCCATATA GGAGCGGAGG ATTAACAAAA 
TATAGTACTG ATTTAATGAT TGAGCAATCA AAAAATAAGG AAATTTATTT GTTGTTTCCT
GGAAGATATA CTATAAATAA TAAATTTAAA ATAAAAAAAT TTAAAAAATT TAGAAATATT
AAAGTTTTTG AGATTTTAAA TCCTAAACCA GTTCCATTAA TGGGTGGTGT AGAGAATATT
GATGAATTTA TAAAACCTAT AGAAAATTGT AGGAAAGAAT CTTTAAAATT TTTAAATACT
ATAAATCCAG ATATTATACA TATTCATACA CTTATGGGTT TACCAATTGA ATTTATAAGA
GCAGCGAAAG AATTAAAAAT AAAAATTATA TTTACTACTC ATGATTATTA TGGATTATGT
CCTAAAGTTA ATTTTTTAAA ATGTAATGGT TGCTTAAATA TGTGCTATGA ATGTAATAAA
AATTCATATA GTATAACAAA AATTAAGATA ATGCAATCAG GTGTTTATAG AACATTTAAA
GAATCATCAA TAATTAAGTA TATTAGAAAA AATGTAAAGA AAAATGATTT AAAAAAAGAA
AGCAATAAAG TATTAAATGG AGATTCTGTG TTAGAAAAAA ATTCGGAATA TGAAAAATTA
AGAGGGTATT ATATAGAAAT TTTAAATCTA TTTGATAAAT TACATTTTAA TAGTAGTATA
ACAAAAGATG TTTATAATAA GTTTTGTAAA ACAAATGGTG ATATTATTTC CATAACACAT
TCAAGTATTA AAGATAATAG ATTAATAAAA GATTTTAAGA ATAAAAAATT AAGAATACTT
TTTTTAGGGT CATTAGATGA ATATAAAGGT TGTTTATATC TTATAAATGT TTTAAAAGAG
ATAAATAGTT CTTTATGGGA ATTAAATATA TATGGAAATG ATTACCAAAT TGATTTTGGA
AATGAAAATA TTCATTTAAA TGGAAGGTAT ATGCAAAAAG AACTATCTAG TATCATGAAA
AATAATGATA TATTAATTGT ACCAAGTTTA TGGAAAGAAA CTTTTGGATT TACTTTACTC
GAAGGCCTCA GTAATGCAAT TCCTATAATT GCAACTGATA CTGTTGGTTC TAAAGATTTA
ATTAAAAATA ATAAAACTGG AATTATAATT AAAAAAGATT TAATGAAAGA AACTATAAAG
AATTTAATTT ATAATAGAGA GATTCTTGAA GAGATAAATA AAAATATAAT AAAAGATAAA
TATGTTTTTG AAATGAAAAA TCATGAAAAA ATCATAGAGG AATATTATAA AAACATTATA
AAGGAATAG
 
Protein sequence
MKILHYNLGL PPYRSGGLTK YSTDLMIEQS KNKEIYLLFP GRYTINNKFK IKKFKKFRNI 
KVFEILNPKP VPLMGGVENI DEFIKPIENC RKESLKFLNT INPDIIHIHT LMGLPIEFIR
AAKELKIKII FTTHDYYGLC PKVNFLKCNG CLNMCYECNK NSYSITKIKI MQSGVYRTFK
ESSIIKYIRK NVKKNDLKKE SNKVLNGDSV LEKNSEYEKL RGYYIEILNL FDKLHFNSSI
TKDVYNKFCK TNGDIISITH SSIKDNRLIK DFKNKKLRIL FLGSLDEYKG CLYLINVLKE
INSSLWELNI YGNDYQIDFG NENIHLNGRY MQKELSSIMK NNDILIVPSL WKETFGFTLL
EGLSNAIPII ATDTVGSKDL IKNNKTGIII KKDLMKETIK NLIYNREILE EINKNIIKDK
YVFEMKNHEK IIEEYYKNII KE