Gene CPF_0774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0774 
Symbol 
ID4202500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp921885 
End bp924671 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content27% 
IMG OID638081658 
Productvon Willebrand factor type A/Cna B-type domain-containing protein 
Protein accessionYP_695225 
Protein GI110799635 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0109767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTA AGGGTATAGA TACCAAAAAA TTAATGTCAA TTATAAGCTT AATAATGACA 
GTTATATTTT TAAGTATTTT ATTGCCTACT AATTTAACAA AAGCAGAAGA AAAGTCTGAT
AGCATGTCTG TTGAAAAAGT ATTAAATTCA GAAATGTATA ATAATTTCAA TAATGAAATA
TTAAACAATA AAAGTAGTAT TAATTACACT ACTGATAATA ATCAAGGTAC ATGGCCAGTA
AATTGGGAGT ATGGAAATGT AAGCAATAAA AATAAAATAA ACAAAAGTGT ATATGGCAAG
AATCAACCAT TAGAATATAA TGAAGGATAT TTAACAAAGA AAGCTTATAC AACAGATGAA
GATAATGTAT TTGACATTAA TCTAAAGATT CAGGGGAAAA AAAATCAATC TTTAAAAAAA
GATGTGGTTT TCCTTTTAGA CAATTCAAAT TCTATGACAA CAAATAATCG TGCAATAAAG
ATTAAAGAAC AAATTAAAAA TGTTATGGAT AAGCTAAATA CTAATAATAC ACGTTATGCG
CTTGTTACTT ATGCCTCAAC AATTTTAGAT GGAAGGTATT ATCATTTAAT TGATAGATCT
ATAGGGGATA ATAAATATAC AGTTTATAAA GGTTATACAA GTAACCAGTG TTATCTAAAT
TTTACTAGTA ATATTCAAGA GATTTATAAT AAAATACCTA CTACTGTTCC AAATCAGAGA
AATAATGGTT ATGTAGGGGG AACATTTACT CAAGAAGGAT TATTGAAAGC AATAGAACTT
TTAAAAAATA GTGATGCTGA TGAAAAAATT ATTATTCATC TTACTGATGG GTTACCAACA
TTTTCTTTTC TTTTAAAAGA GTTTGGAGGA AATGAAAAAG CTATTTTTGA CTATAACACT
CAATATAATG GTATTGGTGT ACGTGGATTT GGGACATCAT ACTTTTTTAA TACTAAAACT
CAAAAGCCGT ATATATATTC TAGAGAAGAA GTATATTCTG CTTTAAATCG TTCAATAAAT
AAAAATGAAT CAATATGGAA TAATGGTTTT CCAACTACTT TAGAAGCAGA GAATATTAAA
AAGGAAAATC CGGACATTAA TATTTACACT ATTGGGATAG AACTTAAAAA AGAAGTATAT
AAATGGGATG ATTATAGAAA ATATTATAAT GCTGAAGGTG TTGTTGAACT TCCAGAAATA
AAAAAATTCT TAGAATCAAT TTCTTCTAGT CCTGCTGAGG CTTTTGTTAA TGAAAATGTT
GATGATATTG ATGAGATTAT TAATAAAATT ATTGATAAAA TAAAGAATTC AATAAATGAT
GGTACTGTTA TAGATCCTAT GGGTGATATG GTTTATATTG TTAAAAATGG AGAATTTAAT
AATGAGGATT ATAAGTTAAC GGCATCTAAT AATAAGTTAT TAGAAGGTGT AAAAGTAGGA
TATAACGAAA AAAATAGACA GATAGTTCTT ACAGGACTAA ATTTAGGTGA AAATGAATGG
GTTGAATTAA ATTATAAAGT AAGATTAAAT ACAAGCAACC CTGACTTTAA TGGGGATTTT
TGGTATCAAG CTAATAAAAG AACTGTTTTA AACCCTAATA ATAAAGAACC AAATATTTTT
CGTGACTTTG TAATACCTTC TGTTAGTGGA AAAAGACCAT CAATAGAAGT TAAATTAAAG
AAGATATCAA GTGAAACATC AAAGCCATTA GCTAATTCAG AATTTGAACT TTATAATTCT
ATAAATGAAA AATTAGGCTC TTTTACAACA AAAGAAAATG GGGAGGTAAG TCTTGGGTAT
TTACCAGAAG GTGAGTATAA GTTAAAAGAA ATAACCCCAC CAAAAGGATA TATTTTATCT
AAAGACTTTA TTAATTTTAA GATTAATAAT GGAAAGGCTA TACAAGATGG AAATGAAGTA
GAATTTATTA CGGTAAGTAA TAAAGTTAAT AGTATATGTA TAAAGAAAAC TGATGATGCA
AAGTTAGAAG CTGATGCTAA ATTTTTAAGT GGAGCAAAAT TTGAATTAAA TAAAGCTAAT
GATAAAAATT TCAAACCTTT AGTAAAAGAA ACTGATGATA AGGGAGAAAT AGAATTTAAT
GAGATTGAAC CTGGTACTTA CTATTTAAAA GAAGTTCTTG CTCCTAATGG ATATGAACAA
ATTAAGGAAG ACATAGGACC AATTGTAGTT GATAATACTG GTGTAGTAAC AATTCCTTGG
GATAAATTAA AAAGCAACGA TGTAGAAAAG TGGAATAATC AAGAGATTAT TCGTATAAAA
AATAAAAAGT TGAAATCGGC TGTATACATA GATAAAGTAG ATGCAATAAA TCAAGGAATA
AAGTTAAGTG GAGCAAAATT TTCACTTTAT ACTAATGATG AAAATTATAA AAATGATAAA
AAGCTAGTAA GAAATGGTGT AAATTATTAT TTAATAAGTG AAAAAGTATC AAATTATGAA
GGTAGAATTG AATGGGATAA TCTTAATTCA GGACAAGAAT ATAAATATCT TATTCAGGAG
ACTGAGGCAC CAAAGGGATA TACTGTAAGT GGAAAAGAAA TATTATTCCA TTTTAAAGAT
AATACTGTTG TTATAGATAA TGAGAATGAT GTAAAAGCAC TTGCTAGTAT AAATGGACAA
GTTATTAGTA TTAAAAATGC TAAAATATAT AAGCTTCCGT CATCTGGCGG TATCGGAGTG
TACCCTTTCT TACTTATAGG GACACTATGT ATGGCTTTAA GTTTAATATA TAGTTTTAAT
AGTAAGGTTT TAAATAAAAG GAAATAA
 
Protein sequence
MNSKGIDTKK LMSIISLIMT VIFLSILLPT NLTKAEEKSD SMSVEKVLNS EMYNNFNNEI 
LNNKSSINYT TDNNQGTWPV NWEYGNVSNK NKINKSVYGK NQPLEYNEGY LTKKAYTTDE
DNVFDINLKI QGKKNQSLKK DVVFLLDNSN SMTTNNRAIK IKEQIKNVMD KLNTNNTRYA
LVTYASTILD GRYYHLIDRS IGDNKYTVYK GYTSNQCYLN FTSNIQEIYN KIPTTVPNQR
NNGYVGGTFT QEGLLKAIEL LKNSDADEKI IIHLTDGLPT FSFLLKEFGG NEKAIFDYNT
QYNGIGVRGF GTSYFFNTKT QKPYIYSREE VYSALNRSIN KNESIWNNGF PTTLEAENIK
KENPDINIYT IGIELKKEVY KWDDYRKYYN AEGVVELPEI KKFLESISSS PAEAFVNENV
DDIDEIINKI IDKIKNSIND GTVIDPMGDM VYIVKNGEFN NEDYKLTASN NKLLEGVKVG
YNEKNRQIVL TGLNLGENEW VELNYKVRLN TSNPDFNGDF WYQANKRTVL NPNNKEPNIF
RDFVIPSVSG KRPSIEVKLK KISSETSKPL ANSEFELYNS INEKLGSFTT KENGEVSLGY
LPEGEYKLKE ITPPKGYILS KDFINFKINN GKAIQDGNEV EFITVSNKVN SICIKKTDDA
KLEADAKFLS GAKFELNKAN DKNFKPLVKE TDDKGEIEFN EIEPGTYYLK EVLAPNGYEQ
IKEDIGPIVV DNTGVVTIPW DKLKSNDVEK WNNQEIIRIK NKKLKSAVYI DKVDAINQGI
KLSGAKFSLY TNDENYKNDK KLVRNGVNYY LISEKVSNYE GRIEWDNLNS GQEYKYLIQE
TEAPKGYTVS GKEILFHFKD NTVVIDNEND VKALASINGQ VISIKNAKIY KLPSSGGIGV
YPFLLIGTLC MALSLIYSFN SKVLNKRK