Gene CPR_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0521 
Symbol 
ID4204141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp618058 
End bp619194 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content29% 
IMG OID642565078 
Productglycine betaine/carnitine/choline transport ATP-binding protein 
Protein accessionYP_697849 
Protein GI110802671 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.691197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGAAA TAAAAAATGT ATCTAAACAA ATAGGGGACA GAAAGATACT AGATAATATT 
TCTTTAACTA TTGATAAAGG AAGTCTAGTT GTTTTAATAG GTTCTAGTGG ATGCGGTAAA
ACAACTACTT TAAAACTTAT AAATAAGTTA ATAACACCTA CTTCTGGTGA AATATTTATA
AATGGGAAAT CTTTAGCCAA GGAAAACCCT ATTGAATTAA GAAGAAATAT TGGATACGTA
ATTCAGAATA ATGGATTGTT TCCACATTTA ACAATAAAGG AAAATATAGA GCTTATTCCA
AGACTAAAAA AGGGCAAAGA TGTTGAAAAA ATAGAGCAGA CAACTTTAGA CCTTTTAAAT
ATGGTTGGAT TAGATCCTGA AATTTATTTA AATAAGTATC CATCAGAGCT TAGTGGAGGA
CAACAACAGA GAGTAGGTTT TGCAAGAGCC TTTGCTACAG ATGCTGAGAT AATTTTAATG
GATGAGCCCT TTAGTGCTTT AGATCCAATA ACTCGTACTT CACTTCAGGA AGAGCTTTTT
ACTATTCAAG AGGAACTTAA AAAGACAATT ATTTTTGTTA CTCATGATAT GGACGAAGCT
TTAAAGATAG CAGATAAAAT ATGCATAATG AATGGAGGAA AAATAGCTCA ATATGATACT
CCAGAAAATA TACTTAGAAA TCCAGCAAAT GATTTTGTTA GAGATTTTAT AGGATCAGAT
AGGGTTTGGA ATAATCCTGA CTTCATAAAA GCAAAGGATA TAATGATAAA AAATCCTGTA
TCTGTTAAAG GTGCTAGAAC CATATTACAG GGAATTGAAA TAATGAGAAG TAATAAGGTT
GATAGTTTAT TAGTTATTGA TAAAGAAAAT GTACTTAAAG GTATTGTTAC TTTTAAAGAT
ATAAAAATTA CAAATGAAAA ATCAAGAGTA CTTTCAGAAA TTATGAGTGA AAATCCACTA
AGAGTTAATG AAGATGATAG TTTGGTTGAT ATTTTAACTG TAATGAATGA AAACTCAGTA
GGATTTATAC CTGTTGTTAA CTCGGAAGAA AAATTGGTAG GGCTTATAAC AAGAAGTAGT
TTACTTTCAA TATTAAGCGA CCAATTCTTA GACATGGAGG TGAATATATT TGAATAG
 
Protein sequence
MIEIKNVSKQ IGDRKILDNI SLTIDKGSLV VLIGSSGCGK TTTLKLINKL ITPTSGEIFI 
NGKSLAKENP IELRRNIGYV IQNNGLFPHL TIKENIELIP RLKKGKDVEK IEQTTLDLLN
MVGLDPEIYL NKYPSELSGG QQQRVGFARA FATDAEIILM DEPFSALDPI TRTSLQEELF
TIQEELKKTI IFVTHDMDEA LKIADKICIM NGGKIAQYDT PENILRNPAN DFVRDFIGSD
RVWNNPDFIK AKDIMIKNPV SVKGARTILQ GIEIMRSNKV DSLLVIDKEN VLKGIVTFKD
IKITNEKSRV LSEIMSENPL RVNEDDSLVD ILTVMNENSV GFIPVVNSEE KLVGLITRSS
LLSILSDQFL DMEVNIFE