Gene CPF_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0541 
SymboltreB 
ID4203920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp643462 
End bp644883 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content34% 
IMG OID638081423 
ProductPTS system, trehalose-specific IIBC component 
Protein accessionYP_694995 
Protein GI110800828 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01992] PTS system, trehalose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000364471 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGT TTGAAAAAGA TGCAAAGTTA CTAACAGAGT ACATAGGCGG AAAAGAAAAC 
GTAGCTGCAG TTACTCACTG TGCTACAAGA ATGAGATTCG TTTTAAATGA TACAAGCAAA
GCAGATGTTG AAAAAATAAA AGCTATTCCT TGTGTAAAAG GAACTTTTAC ACAAGCAGGA
CAGTTTCAAG TAATAATTGG ACCAGAAGTT GCAACTTTTT ATAATGATTT TGTTGGACAG
ACTGGAGTAT CTGGAGAAAG TAAGGAAGAA GTCAAGAAGG CTGCTAAGAA GAATATGAAT
ATAGTTCAGA GAGCAGTTGC AGGCTTAGCA GAAATATTTG CTCCGTTAAT ACCAGCTATC
ATAGTTGGAG GTCTTATATT AGGATTCCGT AATGTAATCG GAGATATGAA ATTATTTGAA
GATGGAACTA AGAGTTTAGT TGAAATTTCT CAATTCTGGG CAGGAACACA TAGTTTCTTA
TGGTTAATAG GAGAGGCAAT ATTCCACTTC TTACCTGTAG GAGTTGTTTG GTCAATTGCT
AAGAAAATGG GAGCAGATCA AATGCTAGGA ATAGTTATTG GTATAACTTT AGTATCTCCA
CAATTATTAA ATGCATATAG TGCAGGTAAT GGAGCCGCTG CTCCAGTTTG GGACTTTGGA
TTTGCTCAAG TTCCAATGAT AGGATATCAA GCACAAGTTT TACCAGCAAT CATGGTTGGA
TTCACTTTTG TTTACTTAGA AAGATTATTT AAGAAGATTA CACCAGGTCC TATACAAATG
ATAATAGTAC CATTTTTCTC AGTAATTCCA ACAGTATTAT TAGCACACAC TGTTTTAGGA
CCTATAGGTT GGAAAATAGG TAGCGTAATA TCAGGAGCTA TAGTGGCAGG CTTAACATCA
TCATTTGGAT GGTTATTTGC AGGAATTTTT GGAATGATTT ATGCACCATT AGTTATAACA
GGATTACATC ATACTTTATT ACCAGTTGAT TTACAACTTA TAAGTGATAT AGGTGGAACA
TTCTTATGGC CAATAATTGC ATTATCAAAT ATAGCTCAAG CATCAGCAGT TTTAGCTATG
ATATATGTAA ATAGAAAAGA TGAAGATGAA AAACAAATAT CAATACCAGC ATGTATTTCA
GGATATTTAG GGGTAACTGA ACCTGCAATG TTTGGGGTTA ACTTAAAATA TCTATATCCA
TTCATAGCTT CTATGATTGG AGCAGGAGTT TCTGGTATGT TCTCAATGGC AATGGGATGT
ATGGCAAACT CAGTAGGAGT TGGTGGATTA CCAGCAATCT TATCAATGCA ATCATCTAGT
ATGTTAATGT ATTTAGTTGC AATGGGAATA GCAATAGTTG TACCATTCAT CTTAACAATA
GTGTTCTCAA AGACAAAATT AGCTAATATG GCTTCAAAAT AA
 
Protein sequence
MSKFEKDAKL LTEYIGGKEN VAAVTHCATR MRFVLNDTSK ADVEKIKAIP CVKGTFTQAG 
QFQVIIGPEV ATFYNDFVGQ TGVSGESKEE VKKAAKKNMN IVQRAVAGLA EIFAPLIPAI
IVGGLILGFR NVIGDMKLFE DGTKSLVEIS QFWAGTHSFL WLIGEAIFHF LPVGVVWSIA
KKMGADQMLG IVIGITLVSP QLLNAYSAGN GAAAPVWDFG FAQVPMIGYQ AQVLPAIMVG
FTFVYLERLF KKITPGPIQM IIVPFFSVIP TVLLAHTVLG PIGWKIGSVI SGAIVAGLTS
SFGWLFAGIF GMIYAPLVIT GLHHTLLPVD LQLISDIGGT FLWPIIALSN IAQASAVLAM
IYVNRKDEDE KQISIPACIS GYLGVTEPAM FGVNLKYLYP FIASMIGAGV SGMFSMAMGC
MANSVGVGGL PAILSMQSSS MLMYLVAMGI AIVVPFILTI VFSKTKLANM ASK