Gene CPF_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0537 
Symbol 
ID4203314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp638514 
End bp639650 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content29% 
IMG OID638081419 
Productglycine betaine/L-proline transport, ATP-binding protein 
Protein accessionYP_694991 
Protein GI110801237 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.418471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGAAA TAAAAAATGT ATCTAAACAA ATAGGGGACA GAAAGATACT AGATAATATT 
TCTTTAACTA TTGATAAAGG AAGTCTAGTT GTTTTAATAG GTTCTAGTGG ATGCGGTAAA
ACAACTACTT TAAAACTTAT AAATAAGTTA ATAACACCTA CTTCTGGTGA AATATTTATA
AATGGGAAAT CTTTAGCCAA GGAAAACCCT ATTGAATTAA GAAGAAATAT TGGATACGTA
ATTCAGAATA ATGGATTATT TCCACATTTA ACAATAAAGG AAAATATAGA GCTTATTCCA
AGACTAAAGA AAGGCAAAGA TGTTGAAAAA ATAGAGAAGA CAACTTTAGA CCTTTTAAAT
ATGGTTGGAT TGGATCCTGA AATTTATTTA AATAAGTATC CATCAGAGCT TAGTGGAGGA
CAGCAACAGA GAGTAGGTTT TGCAAGAGCC TTTGCTACAG ATGCTGAGAT AATTTTAATG
GACGAGCCCT TTAGTGCTTT AGATCCAATA ACTCGTACTT CACTTCAGGA AGAGCTTTTT
AATATTCAAG AGGAACTTAA AAAGACAATT ATTTTTGTTA CTCATGATAT GGATGAAGCT
TTAAAGATAG CAGATAAAAT ATGCATAATG AATGGAGGAA AAATAGCTCA ATATGATACT
CCAGAAAATA TACTTAGAAA TCCAGCAAAT GATTTTGTTA GAGATTTTAT AGGATCAGAT
AGGGTTTGGA ATAATCCTGA CTTCATAAAA GCAAAGGATA TAATGATAAA AAATCCTGTA
TCTGTTAAAG GTGCTAGAAC CATATTACAA GGAATCGAAA TAATGAGAAG TAATAAGGTT
GATAGTTTAT TAGTTATTGA TAAAGAAAAT GTACTTAAAG GTATTGTTAC TTTTAAAGAT
ATAAAAATTA CAAATGAAAA ATCAAGAGCA CTTTCAGAAA TTATGAGTGA AAATCCACTA
AGAGTTAATG AAGATGATAG TTTAGTTGAT ATTTTAACTG TAATGAATGA GAACTCAGTA
GGATTTATAC CTGTTGTTAA CTCAGAAGAA AAATTGGTAG GGCTTATAAC AAGAAGTAGT
TTACTTTCAA TATTAAGCGA CCAATTTTTA GACATGGAGG TGAATATATT TGAATAG
 
Protein sequence
MIEIKNVSKQ IGDRKILDNI SLTIDKGSLV VLIGSSGCGK TTTLKLINKL ITPTSGEIFI 
NGKSLAKENP IELRRNIGYV IQNNGLFPHL TIKENIELIP RLKKGKDVEK IEKTTLDLLN
MVGLDPEIYL NKYPSELSGG QQQRVGFARA FATDAEIILM DEPFSALDPI TRTSLQEELF
NIQEELKKTI IFVTHDMDEA LKIADKICIM NGGKIAQYDT PENILRNPAN DFVRDFIGSD
RVWNNPDFIK AKDIMIKNPV SVKGARTILQ GIEIMRSNKV DSLLVIDKEN VLKGIVTFKD
IKITNEKSRA LSEIMSENPL RVNEDDSLVD ILTVMNENSV GFIPVVNSEE KLVGLITRSS
LLSILSDQFL DMEVNIFE