Gene CPF_2335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2335 
Symbol 
ID4202670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2596807 
End bp2598255 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content32% 
IMG OID638083200 
Productextracellular solute-binding protein 
Protein accessionYP_696758 
Protein GI110799804 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAT TCAAAAAACT AATAGCTTTA ACAGCTTGTG CAATGTTAAC TACTTCAGTT 
GCATTAACTG GATGTGGAGC AGACAAAACA GCAAATGCTG GAGAGGGAGA AACAGTAAAA
CTTACTTGGT ATACAATTGG ACAAACACCA AAAGATTTAG ACATGGTTCA AGAAAAAGCT
AATGAATATT TAAAAGAAAA GATAAATGCT ACTATTGATA TGAAATTTAT TGATTACGGT
GATTACACTC AAAAAATGGG AGTTATAATA AATTCAGGAG AACCATATGA CTTAGCATTT
ACTTGTTCAT GGGCTAACCC ATATTTAGAA AATGCTAGAA AAGGAGCTTT CTTAGAAATA
GACGAATTAT TAGAAACTAA AGGTAAAGAA ATGAAGTCTG TTATAGATGA AAGATTCTGG
GAAGGTGCTA AAATAGATGG TAAAACATAT GCAGTTCCAA ACCAAAAGGA AATAGGAGTT
GCACCTATGT GGGTATTCAC TAAGGAATAT GTTGATAAAT ACAACATACC ATATCAAGAT
ATCCATACTT TAGAAGATTT AGAGCCATGG TTAAAAGTTA TCCATGAAAA TGAGCCAGAT
GTTACACCTT TATACATAAC AAAAGGATTC TCAGCACCAG CTTACTTCGA TCAATTAGTT
GATCCAGTAG GAGTTGAGTA TGGAGATGAA AGCTTAAAGA TTAAAAATAT GTTTGAAACA
GATAAAATGA AGAGCGAATT AGAAACTTTA CAAAAGTACT ATGATGCTGG ATATATAAAT
GCTGACTCAG CTACAGCTAA GGATGATAAA GCAGTTAAGA GATTTGTAAC TAAAGCTGAT
GGACAACCTT ATGCTGATGG ATTATGGTCA AAGGATTTAG GATATGAGGT AGTTTCATCA
CCAATAATGG ATACTCATAT TACTAATGGT TCAACTACAG GATCAATGAT AGCTATTTCT
AAAACTTCAG AGCATCCAGA AAAAGCTATG GAATTCTTAA ACTTATTAAA CACTGACGTA
TATTTAAGAA ACTTACTTAA CTATGGTATA GAAGGAACTC ACTATGAAAA AACTAGTGAT
ACTCAAATAA AATTAACTGA TAAAGCTAAA GACTACTCAG TTGGATACTA TACTTTAGGT
AACTTATTTA TAACTTATAC TTTAGATAAC GAACCAGTTG ATAAGTGGAA AGAATTCGAA
GCATTTAATG ATGCATCAGT TGAATCACCT GCTCTAGGAT TCAAATTTAA CACTGAAAAA
GTAAGTAACC AAATAGCTGC TATAAACAAC GTTCTTGAAG AGTTCAAGGC AACTATATAC
AGTGGATCAG TTAACGAAGC TGAATATTTA GACAAAATGA ACAAGAAATT AAAAGAAGTT
GGAATAGATG AAGTAATTTC AGAAATGCAA AGCCAAATAG ATGCATGGAA AGCTGAAAAT
GGAAAATAA
 
Protein sequence
MLKFKKLIAL TACAMLTTSV ALTGCGADKT ANAGEGETVK LTWYTIGQTP KDLDMVQEKA 
NEYLKEKINA TIDMKFIDYG DYTQKMGVII NSGEPYDLAF TCSWANPYLE NARKGAFLEI
DELLETKGKE MKSVIDERFW EGAKIDGKTY AVPNQKEIGV APMWVFTKEY VDKYNIPYQD
IHTLEDLEPW LKVIHENEPD VTPLYITKGF SAPAYFDQLV DPVGVEYGDE SLKIKNMFET
DKMKSELETL QKYYDAGYIN ADSATAKDDK AVKRFVTKAD GQPYADGLWS KDLGYEVVSS
PIMDTHITNG STTGSMIAIS KTSEHPEKAM EFLNLLNTDV YLRNLLNYGI EGTHYEKTSD
TQIKLTDKAK DYSVGYYTLG NLFITYTLDN EPVDKWKEFE AFNDASVESP ALGFKFNTEK
VSNQIAAINN VLEEFKATIY SGSVNEAEYL DKMNKKLKEV GIDEVISEMQ SQIDAWKAEN
GK