Gene CPF_0771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0771 
Symbol 
ID4203743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp918285 
End bp919934 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content29% 
IMG OID638081655 
Productcell wall surface anchor family protein 
Protein accessionYP_695222 
Protein GI110799961 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.245265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAA GAAATTTTTT AATCAAAAAA ATGATATCAT TGTTAGCTGT TTTTGCTATG 
TTTCTTAGCT TTGGCTCGCC ATTAACAAGT AAAGTAGTAA AAGCTGCTGG ACAAGTACCT
AGTGATAATG ATACAGCATT AGGAACAGTA CACAATGTAG CAGAAGGAAG TACAGTAACT
GCATATCAAA TTGTAAAAGG AAATTATAAT GAGAATGGAT TTATCGGTTA CGTTTTTAAT
AGTGCTTTAG GTGGGAATTT AAAAATTGCT GATCCAACAA AACCAACTCA AGAAGAAATT
CTTGCAATTG CAAAAGATAC TAATGCTTTA AATAGTTTAC CAGCTAATGA AAAAATTACA
ATGACTCCTG GAGAAGATAA AACAAATTAT ACAGCAAATT TAAATGCAGG TTATTGGTTA
GTAATAGCTA CTCCTAAGGA TAAAGGTACT ACTATTTATA ATCCAATGTT ATTAGGTATT
TATTACAATA AAGGTGGTAG CGATAATTCA ATGGAACAAG GCTTAATTGA TGCTAACACA
AATTGGAACC TAAATAGTAC TGAAGCATGG GCTAAAAAAA TAGAGCCTAC TATAAAAAAG
GAAATTGAAA ATCCATACAA TGGTAATAAT AAAGGTGCAG ACCAAGCTGT TGGTGAAAAT
TTTAATTTCA AAGTAACTGC ACTTATTCCA GGATATTCAA AAGAATATAC TAATGTGAAA
TATATTATTA CAGATAAATT AAGTGAAGGA TTAGATTACA ATGATGATTC TATCAAGGTA
ACTGTAAATG GACAAGTCGT TCAAGAAGGA GAAAATACTT TTGAATTTAT AAAAACAGAT
GCTCAAAACA TAAAAATAAA TTTCAATTCT AATTATATTC TTGAAAATGG TGGAAAAGAA
GTTATTGTTA CATATAGTGC AAAATTAAAC AATAATGCAA AATACAACTT TGATCCAAAT
ACAAATGATG TTACTTTAGA ATATAGTAAT AATCCAGATA CAACAAAAGA TAGTACTAAG
ATACATGATC GTACTTATCA ATATACATTT GGTATAGATG CTGATTTATT TGGTTCAAGA
CAAGATAAAA ATAAGGTGAC AAGAGAAATT ATAAAAGTTG ATGAAACTGG TAAAGTTTTA
GATCAACAAA CAGAAAATTT AATGGAAACT GGTCAAAAGG TAGAAGGGCC TTTATCTGGA
GCTGAATTTA CATTAACACC TAAAAATGGA ACTCCGGGAA AAGTACTTAC TGCAATTTCA
GATAATGGTG GTAAATTAAA ATTTACTGGT TTAGCTACTG GTGAGTATGA GTTAGTGGAA
ACTAAAGCAC CAGAAGGATA TGCTTTAAAT AACACACCTC AAAAAGTAAA GATTACTGCT
ACATATAATC CAGAAGGTAC TTTAAAAACT TATAGTATAA CAATTAATGA TCAAAATACA
TCGACATATA CAGCAACCTA TGAACAAGAT GGAAGTATTA AAGAAATTAA AGATGATACA
AAAACTTCAA TAATCAAAAA TACAAAATTA AGTCAATTAC CATCTACAGG TGGAATGGGT
ACATATATAT TCACTTTTGT AGGGGTTACC TTAATGGCTG GTGCTATAGG AAGCCACTTC
TTATTTGGTA AGAAAAAATC AAGAGTATAG
 
Protein sequence
MDKRNFLIKK MISLLAVFAM FLSFGSPLTS KVVKAAGQVP SDNDTALGTV HNVAEGSTVT 
AYQIVKGNYN ENGFIGYVFN SALGGNLKIA DPTKPTQEEI LAIAKDTNAL NSLPANEKIT
MTPGEDKTNY TANLNAGYWL VIATPKDKGT TIYNPMLLGI YYNKGGSDNS MEQGLIDANT
NWNLNSTEAW AKKIEPTIKK EIENPYNGNN KGADQAVGEN FNFKVTALIP GYSKEYTNVK
YIITDKLSEG LDYNDDSIKV TVNGQVVQEG ENTFEFIKTD AQNIKINFNS NYILENGGKE
VIVTYSAKLN NNAKYNFDPN TNDVTLEYSN NPDTTKDSTK IHDRTYQYTF GIDADLFGSR
QDKNKVTREI IKVDETGKVL DQQTENLMET GQKVEGPLSG AEFTLTPKNG TPGKVLTAIS
DNGGKLKFTG LATGEYELVE TKAPEGYALN NTPQKVKITA TYNPEGTLKT YSITINDQNT
STYTATYEQD GSIKEIKDDT KTSIIKNTKL SQLPSTGGMG TYIFTFVGVT LMAGAIGSHF
LFGKKKSRV