Gene CPF_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0468 
Symbol 
ID4203156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp554774 
End bp556003 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content28% 
IMG OID638081351 
ProductUDP-glucose/GDP-mannose dehydrogenase family protein 
Protein accessionYP_694924 
Protein GI110799320 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0698877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TAAATATTAT TGGATTAGGA TATATAGGTT TGCCAACAGC ATTATCATTT 
GCAAGTGCAG GAATTGAAGT CATAGGAACA GATTATAACA AAGAAATAGT TAATACATTA
AATAATGGAC GAGTTACTTT TAAAGAAGAT GGGCTGTATA ACTTATACAA GAAGGCTTTA
AATAAAGGAA TAGAGTTTTC AACTGAGTAT GCTAAAACTA ACAAATACAT AATAACTGTA
CCTACACCAT ATATAAAAGA AAGCAAAAAA TTAGATGCTA GGTATGTTAT TTCTGCAGTA
GAAAGTGTAT TAGAAGTTTG TGATAAAGGG ACAATTCTTG TAATAGAATC AACAGTTTCT
CCAGGTACTA TGGATAAGTT TGTAAGGCCA ATAGTTGAAA ATAGAGGGTT TGTAATTGGA
GAGGATATAC ATTTAGTACA TGCTCCAGAA AGGATAATAC CAGGAAATAT GGTTAATGAG
TTGAAAAATA ACTCAAGAAC TATAGGTTCT GATAATATTG AAATAGGTTA TGAAGTTAAA
AGATGGTATG AAACTTTCTG TAAGGGTGAG ATAGTTGTTA CAGATATTAA AACAGCTGAA
CTTTCAAAAG TTGTTGAAAA TACATTTAGA GATATAAATA TAGCTTTTGC AAATGAGCTT
TCAAGAATAT GTAGAAAAGA GAATATGGAT GTTTATGAAT TAATAAATAT AGCTAATAAA
CATCCTAGAG TTAATATACT TACTCCAGGT CCAGGAGTTG GAGGACATTG TATATCAGTA
GATCCGTGGT TTTTAGTGGG GGATTATCCT GAAATAGTAA ATATAATTAA GGCTGCTAGA
GAGGTTAATG ATTCACAACC GGATTTTGTT ATAGATAGAA TAAGGGATAT CATGAATGAA
AATGGAATAA CTGATATATC AAAAGTTGGT TTATATGGGC TTACATATAA AGAAAATGTT
GATGACTTAA GAGAAAGTCC AACACTTCAA ATATTAGAAA AATTAGAACG ATATTTTGTA
AAAGGAATAA AACTATATGA TCCTTGGATA GAAGAAAAAA TATTTGAAAA TCAATATAAT
AATTTTAATG AGTTTTTAGA TAATATTGAA TTGGTAGTAG TTCTTGTAGC ACATGATCAT
ATAAAAGAAT TTAAAGACAT GCTTGAGGGA AAGTTAATTT TTGATACTAA AAATATATTG
AGAAATTTAA ATGGAATATA TAAGTTATAG
 
Protein sequence
MKKLNIIGLG YIGLPTALSF ASAGIEVIGT DYNKEIVNTL NNGRVTFKED GLYNLYKKAL 
NKGIEFSTEY AKTNKYIITV PTPYIKESKK LDARYVISAV ESVLEVCDKG TILVIESTVS
PGTMDKFVRP IVENRGFVIG EDIHLVHAPE RIIPGNMVNE LKNNSRTIGS DNIEIGYEVK
RWYETFCKGE IVVTDIKTAE LSKVVENTFR DINIAFANEL SRICRKENMD VYELINIANK
HPRVNILTPG PGVGGHCISV DPWFLVGDYP EIVNIIKAAR EVNDSQPDFV IDRIRDIMNE
NGITDISKVG LYGLTYKENV DDLRESPTLQ ILEKLERYFV KGIKLYDPWI EEKIFENQYN
NFNEFLDNIE LVVVLVAHDH IKEFKDMLEG KLIFDTKNIL RNLNGIYKL