Gene CPR_2258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2258 
Symbol 
ID4204732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2475228 
End bp2476967 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content32% 
IMG OID642566810 
Productsolute-binding family 5 protein 
Protein accessionYP_699534 
Protein GI110802287 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AATTAGTTGC ATTATTAACA GTAGGATTAG CAGCTTCAAT GTTATTTGTA 
GCATGTGGTG GAGGAGCTAA CAATACAGCT CAAGGAAATG GTAATGGTTC AGAATCAGGA
GGAACTACTA AGGATTTATC AAAGCCAGAA AGAATAGAGG CATCAAATCC TAGTGCACTT
CCAGATGCTG CTAAGAATAG AACTGATACT TTAATAGTAG GAACTACAGA TCCAAAGGGT
GAATTTGTTC CAATATATTC TTCTACTCTT TATGATTCAT GGGTTAACAA GTTAGTATTT
GATGGATTGA TTACTAATAA TGAAAAAGGT GAACCAATTC CAAATGTAGC AGAAAGTTAT
GAAGTTTCTG AGGATGGAAA AACTTATACA TTTAAATTAA ATAAGGGTAT TAAATTTACT
AATGGTCAAG AATTAACAGC AAAAGATGTT GCATTTACAT TTACTTCTAT TTGTGATCCA
GGATATGATG GACCAAGAAT GGATGCTGTA AATAATTTAG TTGGATATGA AGAGTACAAT
AAGGGCGATG CTAGTAGTGT TGAAGGTATA AAGGTTATTG ATGATTATAC AATATCATTC
ACTAACAAGA ATACTGATGC AGCTGGTATA TGGAATTTTG AATATGGAAT TATGCCTGAA
AGTGTTTATA AATTTGAAAA AGGAAACTTC CAAGCTGTTA AGGATAAATT ATTAGAGCCA
GTAGGTTCAG GTGCTTATAA ATTTGTTCAC TTTAAACCAG GACAAGAAGT TAAGTTTGAA
AAAAATGCTG ATTACTGGAA AGGGGAGCCG AAGATTCCTT ATATAGTAAT GAAAGTTACA
AATGCACAAA CATTATTACA AGAATTAATG GCTGGAACAG TTGATATAGA TAGAGTTGGT
GCTAAACCAG AAAATATAGA TCCATTAAAA CAAGCTGGAT TCTTAAACTT AGATCTTTAT
ATGCAAAATG GTTATGGATA CATGGGGCTT AACTATGGAA GTGATAAGGT TAAAGACCCT
AAAGTAAGAC AAGCGTTACT TTATGGATTA AATAGAGAAG GATTCATGCA ATCTTATTAC
CAAGGATATG GTCAAGTTTA CAACTCACAC ATTCTTCCTA CTTCATGGGC ATATAACCCA
GATGTTCCTA AGTATGAATA CAATCCAGAA AAAGCTAAAG AATTACTTGA TGAAGCAGGC
TGGAAAGATA CAAATGGAAA TGGAGTTAGA GATAAGGATG GAGTTGAATT AGAACTTCAA
TGGTTAACTT ATACTGGTTC TAAATATGTT GATGCTTTAA TCCCAATAGT TCAACAATCT
TGGGAACAAA TAGGTGTTAA AGTTACTCCA GAACTTATGG AATTTGGAAC AATGATGGAT
AAAGTTAATA ACAGAGAATA TGATATATTC AATGGTGCTT GGAACCTTTC AATAGATCCA
GACCCATCAG GAATATTTGC AATTTCTCAA GATGTACCAG GCGGATTTAA TAATATTGGA
TGGAGAAATG AAGAAGCAGA TAAGTTATTA AAAGAAGGTA AAGGAACAAC AAATCAAGAG
GAAAGAAAGA AAGCTTATGC TGAATGGCAA TTAAAATTCT CTGAAGATGT ACCTTATATT
CTTCTTGGAA ATGCACAAGA AATGTTTGCA TCAAATTCAA GAGTTAAAGG ATATAACCCT
TCAACTTATA TAGATTGGAC TCACGATGTT TATAAACTTG AATTAGATAA CAATAAATAA
 
Protein sequence
MKRKLVALLT VGLAASMLFV ACGGGANNTA QGNGNGSESG GTTKDLSKPE RIEASNPSAL 
PDAAKNRTDT LIVGTTDPKG EFVPIYSSTL YDSWVNKLVF DGLITNNEKG EPIPNVAESY
EVSEDGKTYT FKLNKGIKFT NGQELTAKDV AFTFTSICDP GYDGPRMDAV NNLVGYEEYN
KGDASSVEGI KVIDDYTISF TNKNTDAAGI WNFEYGIMPE SVYKFEKGNF QAVKDKLLEP
VGSGAYKFVH FKPGQEVKFE KNADYWKGEP KIPYIVMKVT NAQTLLQELM AGTVDIDRVG
AKPENIDPLK QAGFLNLDLY MQNGYGYMGL NYGSDKVKDP KVRQALLYGL NREGFMQSYY
QGYGQVYNSH ILPTSWAYNP DVPKYEYNPE KAKELLDEAG WKDTNGNGVR DKDGVELELQ
WLTYTGSKYV DALIPIVQQS WEQIGVKVTP ELMEFGTMMD KVNNREYDIF NGAWNLSIDP
DPSGIFAISQ DVPGGFNNIG WRNEEADKLL KEGKGTTNQE ERKKAYAEWQ LKFSEDVPYI
LLGNAQEMFA SNSRVKGYNP STYIDWTHDV YKLELDNNK