Gene CPF_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1785 
Symbol 
ID4202823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2012076 
End bp2013428 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content32% 
IMG OID638082657 
ProductPTS system, sucrose-specific IIBC component 
Protein accessionYP_696221 
Protein GI110799103 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01996] PTS system, sucrose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000306537 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG AACAAATAGT TGCTCAAGAA ATATTAAAAA ATATTGGTGG AAAAGAAAAT 
ATAAAATCAA TGGAACACTG TGCAACTAGA TTAAGACTTA TAGTTAAAGA TAAAAGTCTA
ATAAATGAAA AGGCAATTGA GAATATTGAT GGGGTAAGAG GACAATTTTT TGCTGCTGCT
CAATATCAAA TAATTCTAGG AACTGGGTTT GTAAATAAAG TCTTTGCAGC TATGAATGGT
GAAGGAGTAG AAACTGGGAA TGTAAAAGAA GATGCATATA GTGATATGAC TTTACCACAA
AAAATATCTC GTACTTTAGG AGATATTTTC GTTCCTATAA TTCCAGTATT AGTTGCAACA
GGTTTATTTA TGGGATTAAG AGGACTTTTA ACTAATTTAG GAGTTGAATT TAGTCCAACT
TTTAATACTT TATCAGAGGT TTTAACAGAT ACAGCATTTA TATTCTTACC AGCCTTAGTA
GCTTGGTCAA CAATGAAAAA ATTTGGTGGA ACACCTGTAG TTGGTATAGT ATTAGGGCTT
ATGCTTGTAG CCCCTCAGCT TCCAAATGCT TGGCAGGTAG CAGGAGGAGC AGATCCAATA
TATATTTCTT TATTAGGAAT AAGTATCCCT ATAGTAGGGT ATCAAGGATC AGTACTTCCA
GCTTTAGTAT TAGGTATTAT AGCAGCTAAA TTAGAAAAAT TTATTAGAAA GTTTATGCCA
GATGTACTAG ATTTAATATT TACTCCATTT TTAACTTTAT TAGTATCAAT GATTCTTGGA
CTTTTAGTAG TAGGTCCTAT AATGCATACA AGTGAAGTCT ACATATTAGA TTTATTTAAA
ATGTTTTTAA GCTTACCATT TGGAATAGGT GGAGCAATAA TTGGAGGAGT TCATCAAGTT
ATAGTTGTAA CTGGAGTTCA TCATATATTT AATGCTTTAG AAGTTGAATT AATTTCAAGT
ACAGGATTAA ATCCATTTAA TGCAGTAATA ACCGGGGCTA TAGTAGCTCA AGGAGCAGCA
GCTCTTGCAG TTGGATTTAA GACAAAAGAT AAGAAAAAAC GTTCATTATA TATTTCTTCA
GCAATACCAG CCTTTTTAGG AATTACAGAA GCGGCCATAT TTGGAGTAAA CTTAAGATTT
ATTAAACCAT TTATATTTGC ATGTATAGGT GGAGCTGCAT CAGGAATGTT TGCATCAATG
ATGAAATTGG CTGGAACTGG TATGGGAATA ACAGCTATAC CGGGAACACT GCTTTATATT
AATACAGGAT TAATTCAGTA CTTCATAACA ATTGCTATTG GATTTGCTAT TTCATTTGCA
TTAACATATA TATTCTTTAA ACCACAAGAA TAA
 
Protein sequence
MSKEQIVAQE ILKNIGGKEN IKSMEHCATR LRLIVKDKSL INEKAIENID GVRGQFFAAA 
QYQIILGTGF VNKVFAAMNG EGVETGNVKE DAYSDMTLPQ KISRTLGDIF VPIIPVLVAT
GLFMGLRGLL TNLGVEFSPT FNTLSEVLTD TAFIFLPALV AWSTMKKFGG TPVVGIVLGL
MLVAPQLPNA WQVAGGADPI YISLLGISIP IVGYQGSVLP ALVLGIIAAK LEKFIRKFMP
DVLDLIFTPF LTLLVSMILG LLVVGPIMHT SEVYILDLFK MFLSLPFGIG GAIIGGVHQV
IVVTGVHHIF NALEVELISS TGLNPFNAVI TGAIVAQGAA ALAVGFKTKD KKKRSLYISS
AIPAFLGITE AAIFGVNLRF IKPFIFACIG GAASGMFASM MKLAGTGMGI TAIPGTLLYI
NTGLIQYFIT IAIGFAISFA LTYIFFKPQE