Gene CPF_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0538 
Symbol 
ID4202883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp639643 
End bp641196 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content29% 
IMG OID638081420 
Productglycine betaine/L-proline ABC transporter, permease/glycine betaine/L-proline-binding protein 
Protein accessionYP_694992 
Protein GI110800299 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.863672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATAGTT TATTACAATA TGTAATTTCA CAGAAAACTC AAATATTAGA TTTATTAGTT 
CAACATATAT ATTTAACAAT TACGGCTATA GGAATAGCTA TATTAATAGG AGTTCCTTTA
GGAATTCTTG TTTCAAGGGT TAAGTTTTTA AGAAAGCCTA TAATAGGATT TGTAAACTTA
GTACAAGCTG TGCCTTCTAT GGCCTTACTA GGATTGCTTA TTCCCATATT AGGAATAGGA
AGTACACCTG CAATCTTTAT GGTTGTAGTA TATTCATTGC TTCCAATAGT TAAAAATACT
TACACAGGAA TCTCTGGCAT AGACCCAGTA GTATTAGAAT CTGCTAAGGG AATTGGACTT
ACTAAGAACC AAAGTTTATT TAAGATACAA TTACCATTAG CATTGCCAAT AATAATGTCT
GGTATAAGAA TATCAGCAGT TACAGCCGTA GGGCTTATGA CTTTAGCAGC CTTTATTGGT
GCCGGTGGAC TTGGATACTT AGTATTCTCA GGAGTTCAAA CGGTAAATAA TAATATGATT
TTAGCAGGAG CTATACCAGC TTGTATTTTA GCCTTAATAG TTGACTTTAT TTTCGGAAAA
ATAGAGGTAG CTGTTACACC AAAGGGATTA AGTAATGATA ACAAAAAGAA AAATACTTTT
GTTCTTAAAA TAATAAGTGT AATAATGATA ATAGCTATTT TATTTATGGG AATTTCATCA
TTTATATCAA GTAAAAAAGA TAAAGTGGTT ATTGGATCAA AAAACTTTAC TGAGCAACTT
ATTTTAGGAA ATATGTATGC AGATTTAGTT CAGGATAAAA CAGATTTACA AGTTGAAAAA
AAGCTAAATC TAGGAGGAAC ATCAGTTGCC TTTGGAGCAC TTGAAAAAGG TGATGTTGAC
ATGTATGTTG ATTATACTGG AACATTACTT GTTAATGTTA TGAAAGAAAA TAATATTGAT
AATAGTGTAG ATTATTATAA TAGCATAAAA GAAAATATGA ACAAGGAACA TGGGTTAACA
GTAATGGAAC CCCTAGGTTT TAATAATACT TATAATATAG CTATATCAAA AGAATTAGCT
GATAAGTATA AGATAAATAC AATATCAGAT TTATCAAAGT ACAGTAATGA CTTTGTATTA
TCTCCAACTA TTGAGTTCCA AAATAGACAA GATGGTTTAG TTGGATTAAA GAATTACTAT
GGCATGGATT TCAAAAATGT TAAATCTTTA GATGGAAGTC TTAGATACTC AGCATTATCA
AATGGGGAAT CACAGGCTAT AGATGCTTTC TCAACAGATG GACTTCTTAA AAAGTTTGAT
TTAAAAACTT TAGAAGATGA TAAGAAATTC TTTGTAAATT ATAGTGCAGT ACCTATAGTT
AACAATAAGA CTTTAGAAAA ATATCCACAA TTAAAGGATG TTTTAAACTC TTTAAGTGGT
AAGATCAATG AAGAAAAAAT GATTGACTTA AACTATGAAG TAGATGTATT AGGTAAATCA
CCAGAAGAGG TTGCTAAAGC TTTCTTAATT AGAGAAGGTT TAATAGAACA ATAG
 
Protein sequence
MNSLLQYVIS QKTQILDLLV QHIYLTITAI GIAILIGVPL GILVSRVKFL RKPIIGFVNL 
VQAVPSMALL GLLIPILGIG STPAIFMVVV YSLLPIVKNT YTGISGIDPV VLESAKGIGL
TKNQSLFKIQ LPLALPIIMS GIRISAVTAV GLMTLAAFIG AGGLGYLVFS GVQTVNNNMI
LAGAIPACIL ALIVDFIFGK IEVAVTPKGL SNDNKKKNTF VLKIISVIMI IAILFMGISS
FISSKKDKVV IGSKNFTEQL ILGNMYADLV QDKTDLQVEK KLNLGGTSVA FGALEKGDVD
MYVDYTGTLL VNVMKENNID NSVDYYNSIK ENMNKEHGLT VMEPLGFNNT YNIAISKELA
DKYKINTISD LSKYSNDFVL SPTIEFQNRQ DGLVGLKNYY GMDFKNVKSL DGSLRYSALS
NGESQAIDAF STDGLLKKFD LKTLEDDKKF FVNYSAVPIV NNKTLEKYPQ LKDVLNSLSG
KINEEKMIDL NYEVDVLGKS PEEVAKAFLI REGLIEQ