Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2258 |
Symbol | |
ID | 4204732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2475228 |
End bp | 2476967 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642566810 |
Product | solute-binding family 5 protein |
Protein accession | YP_699534 |
Protein GI | 110802287 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AATTAGTTGC ATTATTAACA GTAGGATTAG CAGCTTCAAT GTTATTTGTA GCATGTGGTG GAGGAGCTAA CAATACAGCT CAAGGAAATG GTAATGGTTC AGAATCAGGA GGAACTACTA AGGATTTATC AAAGCCAGAA AGAATAGAGG CATCAAATCC TAGTGCACTT CCAGATGCTG CTAAGAATAG AACTGATACT TTAATAGTAG GAACTACAGA TCCAAAGGGT GAATTTGTTC CAATATATTC TTCTACTCTT TATGATTCAT GGGTTAACAA GTTAGTATTT GATGGATTGA TTACTAATAA TGAAAAAGGT GAACCAATTC CAAATGTAGC AGAAAGTTAT GAAGTTTCTG AGGATGGAAA AACTTATACA TTTAAATTAA ATAAGGGTAT TAAATTTACT AATGGTCAAG AATTAACAGC AAAAGATGTT GCATTTACAT TTACTTCTAT TTGTGATCCA GGATATGATG GACCAAGAAT GGATGCTGTA AATAATTTAG TTGGATATGA AGAGTACAAT AAGGGCGATG CTAGTAGTGT TGAAGGTATA AAGGTTATTG ATGATTATAC AATATCATTC ACTAACAAGA ATACTGATGC AGCTGGTATA TGGAATTTTG AATATGGAAT TATGCCTGAA AGTGTTTATA AATTTGAAAA AGGAAACTTC CAAGCTGTTA AGGATAAATT ATTAGAGCCA GTAGGTTCAG GTGCTTATAA ATTTGTTCAC TTTAAACCAG GACAAGAAGT TAAGTTTGAA AAAAATGCTG ATTACTGGAA AGGGGAGCCG AAGATTCCTT ATATAGTAAT GAAAGTTACA AATGCACAAA CATTATTACA AGAATTAATG GCTGGAACAG TTGATATAGA TAGAGTTGGT GCTAAACCAG AAAATATAGA TCCATTAAAA CAAGCTGGAT TCTTAAACTT AGATCTTTAT ATGCAAAATG GTTATGGATA CATGGGGCTT AACTATGGAA GTGATAAGGT TAAAGACCCT AAAGTAAGAC AAGCGTTACT TTATGGATTA AATAGAGAAG GATTCATGCA ATCTTATTAC CAAGGATATG GTCAAGTTTA CAACTCACAC ATTCTTCCTA CTTCATGGGC ATATAACCCA GATGTTCCTA AGTATGAATA CAATCCAGAA AAAGCTAAAG AATTACTTGA TGAAGCAGGC TGGAAAGATA CAAATGGAAA TGGAGTTAGA GATAAGGATG GAGTTGAATT AGAACTTCAA TGGTTAACTT ATACTGGTTC TAAATATGTT GATGCTTTAA TCCCAATAGT TCAACAATCT TGGGAACAAA TAGGTGTTAA AGTTACTCCA GAACTTATGG AATTTGGAAC AATGATGGAT AAAGTTAATA ACAGAGAATA TGATATATTC AATGGTGCTT GGAACCTTTC AATAGATCCA GACCCATCAG GAATATTTGC AATTTCTCAA GATGTACCAG GCGGATTTAA TAATATTGGA TGGAGAAATG AAGAAGCAGA TAAGTTATTA AAAGAAGGTA AAGGAACAAC AAATCAAGAG GAAAGAAAGA AAGCTTATGC TGAATGGCAA TTAAAATTCT CTGAAGATGT ACCTTATATT CTTCTTGGAA ATGCACAAGA AATGTTTGCA TCAAATTCAA GAGTTAAAGG ATATAACCCT TCAACTTATA TAGATTGGAC TCACGATGTT TATAAACTTG AATTAGATAA CAATAAATAA
|
Protein sequence | MKRKLVALLT VGLAASMLFV ACGGGANNTA QGNGNGSESG GTTKDLSKPE RIEASNPSAL PDAAKNRTDT LIVGTTDPKG EFVPIYSSTL YDSWVNKLVF DGLITNNEKG EPIPNVAESY EVSEDGKTYT FKLNKGIKFT NGQELTAKDV AFTFTSICDP GYDGPRMDAV NNLVGYEEYN KGDASSVEGI KVIDDYTISF TNKNTDAAGI WNFEYGIMPE SVYKFEKGNF QAVKDKLLEP VGSGAYKFVH FKPGQEVKFE KNADYWKGEP KIPYIVMKVT NAQTLLQELM AGTVDIDRVG AKPENIDPLK QAGFLNLDLY MQNGYGYMGL NYGSDKVKDP KVRQALLYGL NREGFMQSYY QGYGQVYNSH ILPTSWAYNP DVPKYEYNPE KAKELLDEAG WKDTNGNGVR DKDGVELELQ WLTYTGSKYV DALIPIVQQS WEQIGVKVTP ELMEFGTMMD KVNNREYDIF NGAWNLSIDP DPSGIFAISQ DVPGGFNNIG WRNEEADKLL KEGKGTTNQE ERKKAYAEWQ LKFSEDVPYI LLGNAQEMFA SNSRVKGYNP STYIDWTHDV YKLELDNNK
|
| |