Gene CPF_0910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0910 
Symbol 
ID4202911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1073139 
End bp1074983 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content25% 
IMG OID638081792 
Productcapsular polysaccharide biosynthesis protein 
Protein accessionYP_695359 
Protein GI110799171 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTG TACGTAAAAT AAAGTGTTTA TTTCTAATAT TCTTAGATAT TTTATTTATA 
AATTTAGGGT ATTTCATAAG TTTATGTTTT GAATATGGAA AAGAAATGAA AATAGAGTAT
TTTTTTAATA TTAGAAATTT AATTATATTA GCAATTTTTA CGAATATATT GATATTTTGC
TTTTTTAATC TTTATAAAAA TATTTGGCAC ATGGCAGGGA TTAGCGAGTG TATAAGGTGT
TTAATAGCGT CTTCAATAAG TTCTATATTA TTAATATTAT ATAAATTTAT TTTTAATATG
GATGTAACAA TAGTATTTTT AATTAATAGT AGCATTTTAA TTTGTATGTT TTCATTACTT
ACACGTATGT CTATAAGAAT TTTTAGGAAA ATATATTTTC CTTATAAACT TGAATCCAAC
TTAAGAAAAA ATGTTCTTAT AGTTGGAGCT GGTCACTGTG GACGTATTGT TATAGATGAG
ATGAATAAAA ATAATAAATT TAATCCTATT GGTATAGTTG ATGATGATTT AAATAAAAAA
GGAACTTTTT TAAATGGTGT GAAGGTACTT GGAAATAGAG ATGATATAGA AAAAATATGT
AAAAGAATAA AAGTAGATAT TATTTTAATT GCGATTTCAA ATTTATCATC AAATGACAAG
GATGAAATTA TAAAAAGATG TGAAAATACA AAGATAAAAG TAAAAATAAT TCCAAGTATA
TATGATTTAA TTGATGGAAA TGTTAAATTA ACTAATATTA GAGACATTGA TTTAAGAGAT
TTATTAGGAA GAGATGAAAC TAGATTAGAC AAAGAGGAGG TAAATAATTA TATAAAAGAA
AAAATTGTTA TTGTAACTGG AGGAGGAGGT TCTATTGGTT CAGAACTTTG TAGGCAGATA
GCAGTGTTTA ATCCTAAAAA GCTTATTATA TTAGATATTT ATGAAAATTA TGCTTATGAA
TTAGAAAATG AATTAAAAAG AAATTTTAAA AATTTAGATT TAGAAGTTAT TATAGCTTCT
ATAAGAGACA AATCTAGATT AAAGAAAATA TTTGATAAGT ATAAACCAGA TTTAATATTT
CATGCGGCAG CCCATAAGCA TGTACCTTTG ATGGAAAATA ATCCAGAAGA AGCTATAAAA
AATAATGTTT TAGGAACTTT AAATGTAGCA GAATGTGCTG ATGAATTTAA TTTAGAGAAG
TTTGTATTTA TATCTACTGA TAAGGCAGTT AATCCAACTA ATATAATGGG GGCAACTAAA
AGAATTGGTG AGATGATAAT ACAAGCTATG AATGAAGTTA GTAAAACAGA TTTTGTTGCA
GTAAGATTTG GAAATGTATT AGGAAGTAAT GGTTCAGTTA TTCCACTATT TATAGAACAA
ATAAAAAATG GAGGACCTGT TACTTTAACT CATAAGGATA TAACAAGATA TTTTATGTTA
ATTCCAGAAG CAGCACAACT TGTACTTCAG GCAGGAGCTT ATGCAAAAGG TGGAGAAATT
TTTGTTCTTG ATATGGGAAA ACCAGTAAAA ATTTATGATT TAACAGAAAA ATTAATAAGA
TTATCAGGGT TTGAACCGAA TAAAGATATA CAAATAAAGA TTGTGGGACT TAGACCAGGA
GAAAAGCTTT ATGAGGAACT TATTTTAAGT GAAGAAGAGC TTAAAAAGAC TAAGAATGAA
AAAATATTTA TATTAAGTCC ATTTAAGTTT GATATTAAAG AGATTAAAAA GAAAATAGTA
GAACTTTTAA ATGTAGCTTT AAATGAAGAT AAAAAAGCCA TTAAAGAGAA GCTTAAAGAA
ATTGTGAAAA ATTATAGAGA TTTAGAGCAA ATAGATTTTA TATAA
 
Protein sequence
MNSVRKIKCL FLIFLDILFI NLGYFISLCF EYGKEMKIEY FFNIRNLIIL AIFTNILIFC 
FFNLYKNIWH MAGISECIRC LIASSISSIL LILYKFIFNM DVTIVFLINS SILICMFSLL
TRMSIRIFRK IYFPYKLESN LRKNVLIVGA GHCGRIVIDE MNKNNKFNPI GIVDDDLNKK
GTFLNGVKVL GNRDDIEKIC KRIKVDIILI AISNLSSNDK DEIIKRCENT KIKVKIIPSI
YDLIDGNVKL TNIRDIDLRD LLGRDETRLD KEEVNNYIKE KIVIVTGGGG SIGSELCRQI
AVFNPKKLII LDIYENYAYE LENELKRNFK NLDLEVIIAS IRDKSRLKKI FDKYKPDLIF
HAAAHKHVPL MENNPEEAIK NNVLGTLNVA ECADEFNLEK FVFISTDKAV NPTNIMGATK
RIGEMIIQAM NEVSKTDFVA VRFGNVLGSN GSVIPLFIEQ IKNGGPVTLT HKDITRYFML
IPEAAQLVLQ AGAYAKGGEI FVLDMGKPVK IYDLTEKLIR LSGFEPNKDI QIKIVGLRPG
EKLYEELILS EEELKKTKNE KIFILSPFKF DIKEIKKKIV ELLNVALNED KKAIKEKLKE
IVKNYRDLEQ IDFI