Gene CPF_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0473 
Symbol 
ID4202977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp561106 
End bp562176 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content20% 
IMG OID638081355 
Productglycosyl transferase, group 1 
Protein accessionYP_694928 
Protein GI110800200 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000388083 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA CTATTATAGG ACCATATCCA CCACCAATAG GGGGTATATC TATACATATT 
AAAAGATTAA AAAGATTCTT ATACAATAAT GGATTAAAAG TTAAAGTTTT AAATGAGGAA
ACATATATTA ATATATCTGA AAATATAATT CCTTTAAATG GATATAAAAA TTTATTTAAA
TTATTAAGAA AAGATAATAG TGAATTAATT CATTTTCATA CAATAAATAA GTATATAAGA
TCCTTATTAT GGTTAATTAA GATAATTTTT AAGAAAAAAA TTGTTTTAAC TATTCATGGA
CAAAGTATGG TTAATCAATA CAAAAATTCA AATATTTTCA TAAGAAAAAT GTTAATAAAA
ACATTAAATG ATATAGATGC AACAATATTT GTTGATAAAA ATAATTTGAA TTTCTTTGAT
AAAATTGTAA AAAATAGAGA TAAACTTAAA TATATAAATC CATTTATTTT TCCATGTATT
TCTGAGAATA ATATTGAAAG ACCAAATTTA AAAAGTTTTT TTAATAACAA TGATTTTAAA
ATACTAATGA GTGGGAATAT AAGGTTTTTT GAAGGTAAGG AATTATATGG ATTTAAATCT
ATGATTAATG CAACTAAAAA GCTTAAAGAA AAAGGTGAAA ATATAAAAGT GCTAATGATT
ATTATGGAAT CAAATAATCA ATCTATTGAG GAAAAAGAAT ATTATAAATT TATAAAAAAA
CAAGTTTTAG ATAATGAACT ACAAAATTAT ATATTTTTTT ATGAACCAAT AAATGAAGAA
ATATTTGATT TATTTGGAAA AGTTAATTTA TTTGTCAGAC CAACTATTGT AGATGGATTT
GGTATTTCTT TAGCAGAATC AATATATATG AAAACACCTG CTATTGCCAC TGATGTTTGT
ATTAGACCAA ATGGAACTAT TTTATATCAT GATAATGAAG AATTAGCTAA TATTATATAT
AATGTAAAGA ATAATTATGA TTATTATAAA AATAGTTTAG ATAATATAAA AATACAAGAT
AGTTCTAAAG ATATTTTAGA ATTATACAAT AATTTACTTA AGAAGGTGTA G
 
Protein sequence
MKITIIGPYP PPIGGISIHI KRLKRFLYNN GLKVKVLNEE TYINISENII PLNGYKNLFK 
LLRKDNSELI HFHTINKYIR SLLWLIKIIF KKKIVLTIHG QSMVNQYKNS NIFIRKMLIK
TLNDIDATIF VDKNNLNFFD KIVKNRDKLK YINPFIFPCI SENNIERPNL KSFFNNNDFK
ILMSGNIRFF EGKELYGFKS MINATKKLKE KGENIKVLMI IMESNNQSIE EKEYYKFIKK
QVLDNELQNY IFFYEPINEE IFDLFGKVNL FVRPTIVDGF GISLAESIYM KTPAIATDVC
IRPNGTILYH DNEELANIIY NVKNNYDYYK NSLDNIKIQD SSKDILELYN NLLKKV