Gene CPF_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0721 
Symbol 
ID4202671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp856777 
End bp858861 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content29% 
IMG OID638081606 
Productputative exo-alpha-sialidase 
Protein accessionYP_695173 
Protein GI110799574 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACA AAGGGATAAC TTTAATTTTA ACAGCTGCTA TGGTTATAAG TGGTGGTAAT 
TATGTATTGG TTAAAGGAAG TACTTTAGAC TCAGGGAAAA ATAATAGTGG GTATGAGATT
AAAGTAAATA ATAGTGAAAA TTTAAGTTCA CTAGGAGAAT ATAAGGATAT AAATTTAGAA
AGTTCAAATG CTTCTAATAT TACTTATGAT TTAGAAAAAT ATAAAAATTT AGATGAAGGT
ACTATTGTTG TAAGATTTAA TTCCAAGGAC TCTAAAATAC AAAGCTTATT AGGAATAAGT
AATAGCAAGA CTAAAAATGG ATATTTTAAT TTTTATGTTA CTAATTCAAG AGTTGGTTTT
GAGCTTAGAA ATCAAAAAAA TGAGGGGAAT ACACAAAATG GAACTGAGAA TCTAGTACAT
ATGTATAAAG ATGTTGCTTT AAATGATGGG GATAATACTG TTGCCTTAAA GATTGAAAAA
AATAAAGGTT ATAAGCTTTT CCTTAATGGA AAAATGATAA AAGAAGTTAA AGATACAAAT
ACAAAGTTTT TAAATAATAT AGAAAATTTA GATAGTGCTT TTATTGGAAA AACTAATAGA
TATGGACAAT CTAATGAATA TAATTTTAAG GGCAATATAG GTTTTATGAA TATATATAAT
GAACCTTTAG GCGATGATTA TTTACTTAGT AAGACTGGAG AGACAAAAGC TAAAGAAGAA
GTTCTTGTAG AGGGAGCTGT AAAGACAGAG CCAGTTGATT TATTTCATCC AGGATTTTTA
AATTCTAGTA ATTACAGAAT ACCGGCCTTA TTTAAAACAA AAGAAGGAAC TTTAATAGCT
TCAATAGATG CAAGAAGACA AGGGGGAGCT GATGCGCCTA ATAATGACAT AGATACAGCA
GTTAGAAGAA GTGAAGATGG AGGGAAAACT TGGGATGAAG GACAAATAAT AATGGACTAT
CCTGATAAAT CCTCTGTTAT AGATACAACT TTAATTCAAG ATGATGAAAC AGGAAGAATA
TTCTTGTTAG TAACTCATTT CCCTTCAAAA TATGGCTTTT GGAATGCTGG ATTAGGAAGT
GGATTTAAAA ATATTGATGG AAAAGAATAT TTATGTCTTT ACGATTCATC AGGTAAAGAA
TTTACTGTAA GAGAAAATGT AGTATATGAC AAAGATGGCA ATAAAACAGA ATATACAACA
AATGCTTTAG GAGATTTATT TAGAAATGGA ACCAAGATAG ATAATATAAA TTCTAGTACA
GCACCTTTAA AAGCAAAAGG GACAAGCTAT ATAAATCTTG TATATAGTGA TGATGATGGA
AAGACTTGGA GTGAGCCACA AAATATTAAT TTCCAAGTTA AAAAGGATTG GATGAAGTTT
TTAGGAATAG CTCCAGGTAG GGGAATACAA ATAAAAAATG GAGAACATAA AGGAAGAATA
GTAGTTCCTG TTTACTATAC AAATGAAAAA GGTAAACAAT CTAGTGCTGT AATATATAGT
GATGATAGTG GTAAGAATTG GACAATAGGA GAATCTCCAA ATGATAATAG AAAATTAGAA
AATGGAAAGA TAATAAATTC TAAAACCTTA TCAGATGATG CACCTCAATT AACTGAATGT
CAAGTAGTAG AAATGCCAAA TGGTCAATTA AAATTATTTA TGAGAAATTT AAGTGGATAT
TTAAATATTG CTACAAGTTT TGATGGGGGA GCTACTTGGG ATGAGACAGT AGAGAAAGAT
ACTAATGTTT TAGAGCCATA TTGCCAACTA AGTGTTATAA ATTATAGTCA AAAGATAGAT
GGAAAGGATG CTGTTATTTT CTCAAATCCA AATGCAAGAA GTAGATCAAA TGGTACTGTG
AGAATTGGAT TGATTAATCA AGTTGGAACA TACGAAAATG GTGAACCTAA ATATGAATTT
GATTGGAAAT ATAATAAATT AGTTAAACCA GGTTATTATG CTTATTCTTG CTTAACAGAA
TTAAGCAATG GAAATATTGG ATTATTATAT GAGGGTACAC CAAGTGAAGA AATGTCTTAT
ATTGAAATGA ATTTAAAATA TTTAGAGAGT GGAGCTAATA AATAA
 
Protein sequence
MNYKGITLIL TAAMVISGGN YVLVKGSTLD SGKNNSGYEI KVNNSENLSS LGEYKDINLE 
SSNASNITYD LEKYKNLDEG TIVVRFNSKD SKIQSLLGIS NSKTKNGYFN FYVTNSRVGF
ELRNQKNEGN TQNGTENLVH MYKDVALNDG DNTVALKIEK NKGYKLFLNG KMIKEVKDTN
TKFLNNIENL DSAFIGKTNR YGQSNEYNFK GNIGFMNIYN EPLGDDYLLS KTGETKAKEE
VLVEGAVKTE PVDLFHPGFL NSSNYRIPAL FKTKEGTLIA SIDARRQGGA DAPNNDIDTA
VRRSEDGGKT WDEGQIIMDY PDKSSVIDTT LIQDDETGRI FLLVTHFPSK YGFWNAGLGS
GFKNIDGKEY LCLYDSSGKE FTVRENVVYD KDGNKTEYTT NALGDLFRNG TKIDNINSST
APLKAKGTSY INLVYSDDDG KTWSEPQNIN FQVKKDWMKF LGIAPGRGIQ IKNGEHKGRI
VVPVYYTNEK GKQSSAVIYS DDSGKNWTIG ESPNDNRKLE NGKIINSKTL SDDAPQLTEC
QVVEMPNGQL KLFMRNLSGY LNIATSFDGG ATWDETVEKD TNVLEPYCQL SVINYSQKID
GKDAVIFSNP NARSRSNGTV RIGLINQVGT YENGEPKYEF DWKYNKLVKP GYYAYSCLTE
LSNGNIGLLY EGTPSEEMSY IEMNLKYLES GANK