Gene CPR_0522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0522 
Symbol 
ID4206096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp619187 
End bp620740 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content29% 
IMG OID642565079 
Productglycine betaine/carnitine/choline ABC transporter, permease/substrate-binding protein 
Protein accessionYP_697850 
Protein GI110803807 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.271995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATAGTT TATTACAATA TGTAATTTCA CAGAAAACTC AAATATTAGA TTTATTAGTT 
CAACATATAT ATTTAACAAT TACGGCTATA GGAATAGCTA TATTAATAGG AGTTCCTTTA
GGAATTCTTG TTTCAAGGGT TAAGTTTTTA AGAAAGCCTA TAATAGGATT TGTAAACTTA
GTACAAGCTG TGCCTTCTAT GGCCTTACTA GGATTGCTTA TTCCCATATT AGGAATAGGA
AGTACACCTG CAATCTTTAT GGTTGTAGTA TATTCATTGC TTCCAATAGT TAAAAATACT
TACACAGGAA TCTCTGGCAT AGACCCAGTA GTATTAGAAT CTGCTAAGGG AATTGGACTT
ACTAAGAATC AAAGTTTATT TAAGATACAA TTACCATTAG CATTGCCAAT AATAATGTCT
GGTATAAGAA TATCAGCAGT TACAGCCGTA GGGCTTATGA CTTTAGCAGC CTTTATTGGT
GCCGGTGGAC TTGGATACTT AGTATTCTCA GGAGTTCAAA CGGTAAATAA TAATATGATT
TTAGCAGGAG CTATACCAGC TTGTATTTTA GCCTTAATAG TTGACTTTAT TTTTGGAAAA
ATAGAGGTAG CTGTTACACC AAAGGGATTA AGTAATGATA AGAAAAAGAA AAATACTTTT
GTTCTTAAAA TAATAAGTGT AATAATGATA ATAGCTATTT TATTTATGGG AATTTCATCA
TTTATATCAA GTAAAAAAGA TAAAGTGGTT ATTGGATCAA AAAACTTTAC AGAGCAACTT
ATTTTAGGAA ATATGTATGC AGATTTAGTT CAGGCTAAAA CAGATTTACA AGTTGAAAAA
AAGTTAAATC TAGGAGGAAC ATCAGTTGCC TTTGGAGCAC TTGAAAAAGG TGATGTTGAC
ATGTATGTTG ATTATACTGG AACATTACTT GTTAATGTTA TGAAAGAAAA TAATATTGAT
AATAGTGTAG ATTATTATAA TAGCATAAAA GAAAATATGA ACAAGGAACA TGGGTTAACA
GTAATGGAAC CCCTAGGTTT TAATAATACT TATAATATAG CTATATCAAA AGAATTAGCT
GATAAGTATA AAATAAATAC CATATCAGAT TTATCAAAGT ACAGTAATGA CTTTGTATTA
TCTCCAACTA TTGAGTTCCA AAATAGACAA GATGGTTTAG TTGGATTAAA GAATTACTAT
GGCATGGATT TCAAAAATGT TAAATCTTTA GATGGAAGTC TTAGATACTC AGCATTATCA
AATGGGGAAT CACAGGCTAT AGATGCTTTC TCAACAGATG GACTTCTTAA AAAGTTTGAT
TTAAAAACTT TAGAAGATGA TAAGAAATTC TTTGTAAATT ATAGTGCAGT ACCTATAGTT
AACAATAAGA CTTTAGAAAA ATATCCACAA TTAAAGGATG TTTTAAACTC TTTAAGTGGT
AAGATCAATG AAGAAAAAAT GATTGACTTA AACTATGAAG TAGATGTATT AGGTAAATCA
CCAGAAGAGG TGGCTAAAGC TTTCTTAATT AGAGAAGGTT TAATAGAACA ATAG
 
Protein sequence
MNSLLQYVIS QKTQILDLLV QHIYLTITAI GIAILIGVPL GILVSRVKFL RKPIIGFVNL 
VQAVPSMALL GLLIPILGIG STPAIFMVVV YSLLPIVKNT YTGISGIDPV VLESAKGIGL
TKNQSLFKIQ LPLALPIIMS GIRISAVTAV GLMTLAAFIG AGGLGYLVFS GVQTVNNNMI
LAGAIPACIL ALIVDFIFGK IEVAVTPKGL SNDKKKKNTF VLKIISVIMI IAILFMGISS
FISSKKDKVV IGSKNFTEQL ILGNMYADLV QAKTDLQVEK KLNLGGTSVA FGALEKGDVD
MYVDYTGTLL VNVMKENNID NSVDYYNSIK ENMNKEHGLT VMEPLGFNNT YNIAISKELA
DKYKINTISD LSKYSNDFVL SPTIEFQNRQ DGLVGLKNYY GMDFKNVKSL DGSLRYSALS
NGESQAIDAF STDGLLKKFD LKTLEDDKKF FVNYSAVPIV NNKTLEKYPQ LKDVLNSLSG
KINEEKMIDL NYEVDVLGKS PEEVAKAFLI REGLIEQ