Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1036 |
Symbol | |
ID | 4204398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1180093 |
End bp | 1181256 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642565593 |
Product | sodium:dicarboxylate symporter family protein |
Protein accession | YP_698359 |
Protein GI | 110801588 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000829053 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAAT TATTTAATAA TCTTATTTTC AAATTAATTT TAGGTGTTAT ATTAGGAATA ATAATAGGCA CATACTCTTC AGAGGGGCTT ATGTCAACAA TTGTGACAAT TAAGTATGTA TTGGGACAAA TTATATTTTT CTCTGTTCCA CTTATTATTT TAGGGTTTAT AGCGCCATCT ATTGCTAAGT TAAAAGATAA TGCAAGCAAA TTATTAGGAT ATGCTGTTTT AATAGCTTAT TTATCTTCAG TTTTTGCTGC TATTCTTTCA ATGATTGCAG GATATGCATT AATACCTAAA TTATCTATAG TATCTAATAT AGCATCATTA AAGGAATTAC CAGAACTTAT ATTTAAATTA GATATACCAC CAGTTATGAG TGTAATGAGT GCATTAGCTT TAGCATTACT TTTAGGATTA GCTGTTGGAT GGACAAAGGC TGATTTAGTA GAAAAGCTTT TAGATCAATT TCAAGCTATA GTACTTAGTA TTGTAAATAA AATAATAATA CCAATATTAC CATTTTTCAT AGCAACTAAC TTTGCAGCTT TAGCATATGA AGGAGGATTA AGTAATCAAC TTCCTGTATT CTTTAAAGTT ATATTAATTG TATTATTTGG TCATTTTATA TGGTTAACAA TTTTATATTT AATAGGTGGA GCAATATCAA AAGAAAATCC ATGGGAAGTT GTAAAATACT ATGGACCAGC ATATCTTACT GCAGTTGGTA CAATGTCAAG TGCAGCAACA TTACCAGTAG CTTTAGAGTC TGCAAAGAAA TCAAAGGCTT TAAGAGAAGA TATAGTTGAT TTTGCAATAC CATTATGTTC AAATATACAT TTATGTGGTT CAGTTCTTAC AGAGGTATTT TTTGTAATGA CAGTATCTCA AATTTTATAT GGTAAGATTC CTAGTTTACC AACTATGATA TTGTTTATAG TATTATTAGG AGTGTTTGCA ATAGGGGCAC CAGGAGTCCC AGGGGGAACT GTAATGGCAT CATTAGGTTT AATAATTAGT GTATTAGCCT TTGATGAGGC TGGTACAGCT CTTATGTTAA CAATATTTGC TCTTCAAGAT AGTTTTGGAA CAGCATGTAA TGTAACTGGT GATGGAGCAA TAGCTCTTAT GCTGACAGGT ATAGCAAAGA AAAAGAATTT ATAA
|
Protein sequence | MKKLFNNLIF KLILGVILGI IIGTYSSEGL MSTIVTIKYV LGQIIFFSVP LIILGFIAPS IAKLKDNASK LLGYAVLIAY LSSVFAAILS MIAGYALIPK LSIVSNIASL KELPELIFKL DIPPVMSVMS ALALALLLGL AVGWTKADLV EKLLDQFQAI VLSIVNKIII PILPFFIATN FAALAYEGGL SNQLPVFFKV ILIVLFGHFI WLTILYLIGG AISKENPWEV VKYYGPAYLT AVGTMSSAAT LPVALESAKK SKALREDIVD FAIPLCSNIH LCGSVLTEVF FVMTVSQILY GKIPSLPTMI LFIVLLGVFA IGAPGVPGGT VMASLGLIIS VLAFDEAGTA LMLTIFALQD SFGTACNVTG DGAIALMLTG IAKKKNL
|
| |