Gene CPR_0931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0931 
Symbol 
ID4204324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1064992 
End bp1066617 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content21% 
IMG OID642565489 
Productsolute-binding family 3 protein 
Protein accessionYP_698255 
Protein GI110801475 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain
[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000314501 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ATAAAAGATT TATAATTCTT ATATTATTTT TTTTAATATT CATAATACTA 
TTTTTGCTTC ATGGAGTCAT CATAAATAAA GAAAAGAGAG AGCAAGTAGT TAAAATTGGA
TTTTACGACG ATTATCCTCA TTTTTATATA GATCATAGAG GAAATGTTTG TGGATATTAT
AAAGACATAA CTGAAAAATT AGCTGAAAAA CTTAATTTTA AAATTGAATA TGTAAACGGA
AAAGTTCCAG AACTTTTAAA AGAGCTTAAA AATGGAGAAA TAGATTTGGT TTTTGGAATA
AATAAATTGC CAGAGAGGGA GGAACTCTTT GACTTCACAA ATAATCCTAT AAATAATGAG
ATGAAGCTTA TATATACTAA TAATGACATA AAATATGGTG ATTTAGAAGG CTTAAATGGT
ATGAAAATGG GGTATATAGA AGGTGAGTTA GATAATGAAT GGATATTAGA TTATTTAAGC
AAAAGAAATA TAAATGTTGA ATTAGTTAAT GGATCTTCTT ATAAAGCAAT AAAAACATTG
TTAGTTGAAA ATAAAGTAGA CTTTATTGTA GATAATCCAG ATAGTGATAT AAAAAATAAG
GGTAAAAATA TTAAAGAGAT CTTTCAATTT TCATCTGGAC CAAAATACAT TGTAGCAAAT
AAAAATAATA ATGAGTTAAT AAAAAAAATT GATAAATCAC TTACTACAAT TAATCTTAAT
TCATATCTTG ATAATGACGA TTATATAAAA AAAACTCATA ATTTTATTAT AGCCATTGTC
AATAAAAATA TTATTCATTT AATTGTCTTT ATTTTAATTA TAATTTTATT TAATAGAGGT
AAAAAAGGAA TAGCTAAAAT ATTAAATAAG AAAAAAATAT ATAATGATTT AAAAAAAGAC
AACTATATTT TATATTATCA ACCAATAGTT GATTTTAGGA ATAACACAAT AAGAAGTGTT
GAGGCCTTAT TAAGATTAAA AAAAGATGGT CAATTATTAA CTCCCAATTA TTTTATGAAT
ATTATAGAAG AAGCTAATAT GATGAAAGAA ATCACATTAT GGGTATTAAA TAAGGTAATT
AAAGATTATA ATATTATAAG TTGTTACAAT AATATAAATG AAGAAAATTT TTATATTTCT
ATAAATGTAT CTTTTAATGA GATAAAAGAT ATAGAGTTTT TAAAGAAAAT AGTCAAAATA
GTCAATGATA ATAAAATAAT AAAAAATAGT ATTTGTCTAG AAATTATAGA AAAATTTGGG
GTAGAAGAGG TAGAAAAAAT ACAAAAAAAC ATTAAGTTTT TACGAGAAAA TGGAATTCTA
ATTGCAATAG ATGATTTTGG TGTGGAGTAT TCAAACTTAG ATTTATTAAA AAAAATAGAT
TCTAATATCA TTAAATTAGA TAAGTTTTTT GCAGATGGAA TTAATGATTC AGAAATAAGT
ATTAAAGTAA TAGGTTTTAT ATTAGATATA TGCAGATTAT CAAATAAATC TATAGTTATA
GAAGGAATAG AGAAAAAAGA ACAAGTAGAT ATAATAAAAA CATTTCTTTA TGAAAAAATT
TATATTCAAG GATACTATTT TTCAAAACCA TTAGATATTA ATAGTTTAAA AGATTATACT
TTTTAA
 
Protein sequence
MKKNKRFIIL ILFFLIFIIL FLLHGVIINK EKREQVVKIG FYDDYPHFYI DHRGNVCGYY 
KDITEKLAEK LNFKIEYVNG KVPELLKELK NGEIDLVFGI NKLPEREELF DFTNNPINNE
MKLIYTNNDI KYGDLEGLNG MKMGYIEGEL DNEWILDYLS KRNINVELVN GSSYKAIKTL
LVENKVDFIV DNPDSDIKNK GKNIKEIFQF SSGPKYIVAN KNNNELIKKI DKSLTTINLN
SYLDNDDYIK KTHNFIIAIV NKNIIHLIVF ILIIILFNRG KKGIAKILNK KKIYNDLKKD
NYILYYQPIV DFRNNTIRSV EALLRLKKDG QLLTPNYFMN IIEEANMMKE ITLWVLNKVI
KDYNIISCYN NINEENFYIS INVSFNEIKD IEFLKKIVKI VNDNKIIKNS ICLEIIEKFG
VEEVEKIQKN IKFLRENGIL IAIDDFGVEY SNLDLLKKID SNIIKLDKFF ADGINDSEIS
IKVIGFILDI CRLSNKSIVI EGIEKKEQVD IIKTFLYEKI YIQGYYFSKP LDINSLKDYT
F