Gene CPF_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1799 
Symbol 
ID4203112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2029106 
End bp2031094 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content29% 
IMG OID638082669 
Productmolybdopterin oxidoreductase 
Protein accessionYP_696233 
Protein GI110801191 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.381275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTAT TAAGTCATGG GTGTACATTA GATTGCTTTG ATTGTTGTAA ATTTAATGTT 
TATAAAGAAG GAAGTGAAAT TCTAAAAATA GAAGGTGACA AAGAGCATCC ATTTACAAAG
GGACTCATAT GTAAAAAAGG AGTAGCTCAC TTAAACAGAT TGAATCATAA AGATAGAATA
TATACTCCTC TTTTAAAGAA TAATGGAGTA TGGGAAGAGA TTTCCTTTGA AGATGCCTTA
GAAATAATGA AAGAGAAACT TGAATATACA AAAGAGAAAT ATTCTTCTAA GTCAATATTA
TATTATAGCC AATATGGAAG TGGAGGAGTA CTAAAGGGAA TAGAGGATAT ATTCTTTAAT
TTTTATGGTG GTGTAAGTAA AGCTACAGGA GGTCCTTGTT GGAGTGCTGG AATGAGAGCT
CAAAAATATG ATTTTGGAGA TTCTGTATCA AATTCCTTAG AGGATATGAT AAATAGTAAA
AACATATTTT TATGGGGTAA AAATCCTGCA AATACAACCA TACATACTAT GGCGATTTTA
AATAAGGCAA AGAAAAACGG GAGCAGAATA ATAGTTATAG ATCCAATAAA TACTCAAAGT
GCAAAGCTTG GAGATATTCA TGTGAAAATT AAACCAGGGA CAGATGGAGC TTTAGCTATG
GCTATGGCAA AAATAATAAT TTCTAAAGGT CTTCAAGATA AGGATTTTAT AAATAAATAT
GTTTTAGGAT TCCAAGAATA TAAAGATCAT TTAGAGAATT TTGATTTGGA TTATTTAAGT
GATGAATGTG GAATAGAAAT AGAGGATATA GAAAAGCTAA CTAAATACTA TTGTGAAAAA
AATTCTAGCA TATATTTGGG ATATGGAATG CAAAAATATA AGAATGGTGG AAATACCATA
AGAGCTATTG ATGCCTTAGG TGCTTTAACT GGTCAAATTG GAGTTAAAGG CGGTGGAGTA
AATTACGCAA ATAAGGTATT AAGTAGGATT TTAGATTCAG ATCCCTTTAA AAGTGGAGAA
GTTGGAGAAA ATAGAGAATT TTATGTCTCT AATATAAATG AGTTTATAGA AGAACCTAAA
AAATATTCTT TAAGTGTAGA AGATTCTAAT GCACCTATAA AAATCATGGT AATAGCTAAT
AGCAATCTCA TGAATCAACT TCCAAACTTA AATAGATTAA ATAATAGTAT AGATAAAGTT
GAGTTTAAAG TTTGTTTTGA TATGTTTATG ACAGATACTG CTTCAAAGTG TGATTTATTC
ATTCCTTGTA CCAATACCTT AGAAAGCGAG GATATGGTTT TTAGTTCAAT GACTAATCCA
TATTTAATAT ATAATGAAAA GATAATAGAA CCTAGAGAAA AACTTATGGA TGAATACTAC
TTTTTTAGAG AGTTAGCTAA AAGAATGAAT TTAAAAGGAT ATCCAAGCCT ATCTAAAAAA
GATTATTTAA GTAAGGTTAT TGAACCTTTA ATGAGATATA ATAAAGATAT AACCTTAGAT
TATTTAAAAA ATAATCCTTT TACAGTTTAT GATGATGTAG CTTGGGAAAA TAAAAAGTTT
AAAACACCTT CTGGCAAATT TGAACTTGCA TCTAAAAGAG CTTTAAAAGA GTGCGGAAGC
TTAACACCAA CATACTTAAG TCCTAGAATA AAAGAAAACT GTTTTAGATT GCTTACTAAT
CATTCTAAAG ATTCATTATC AAGTCAGCAT TATATAGATG TAGATGAAAA GGCAAAAGTA
TATTTAAATG AGAATATGAT AAGAAAGTTT TCCTTAATTT GCGGAGAAAA GGTTAAATTA
AAATCAAGGA CTGGAGAGAT TACAGCAATT TGTTCCTTGG ATAATGGGGT TCAAGATTAT
GTGGCTTTAA TGTATGTTGG TTGGTGGAAA AAACATGGGA ATCCAAACTT TTTAACTGAA
TCAGGAATCT CTGATATGGG TGGACAAATA ACATATAATG AAACCTTTAT AGAGATTGAA
AATATATAA
 
Protein sequence
MEVLSHGCTL DCFDCCKFNV YKEGSEILKI EGDKEHPFTK GLICKKGVAH LNRLNHKDRI 
YTPLLKNNGV WEEISFEDAL EIMKEKLEYT KEKYSSKSIL YYSQYGSGGV LKGIEDIFFN
FYGGVSKATG GPCWSAGMRA QKYDFGDSVS NSLEDMINSK NIFLWGKNPA NTTIHTMAIL
NKAKKNGSRI IVIDPINTQS AKLGDIHVKI KPGTDGALAM AMAKIIISKG LQDKDFINKY
VLGFQEYKDH LENFDLDYLS DECGIEIEDI EKLTKYYCEK NSSIYLGYGM QKYKNGGNTI
RAIDALGALT GQIGVKGGGV NYANKVLSRI LDSDPFKSGE VGENREFYVS NINEFIEEPK
KYSLSVEDSN APIKIMVIAN SNLMNQLPNL NRLNNSIDKV EFKVCFDMFM TDTASKCDLF
IPCTNTLESE DMVFSSMTNP YLIYNEKIIE PREKLMDEYY FFRELAKRMN LKGYPSLSKK
DYLSKVIEPL MRYNKDITLD YLKNNPFTVY DDVAWENKKF KTPSGKFELA SKRALKECGS
LTPTYLSPRI KENCFRLLTN HSKDSLSSQH YIDVDEKAKV YLNENMIRKF SLICGEKVKL
KSRTGEITAI CSLDNGVQDY VALMYVGWWK KHGNPNFLTE SGISDMGGQI TYNETFIEIE
NI