Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1370 |
Symbol | |
ID | 4202971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1548207 |
End bp | 1549427 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 638082251 |
Product | sodium:dicarboxylate symporter family protein |
Protein accession | YP_695816 |
Protein GI | 110800194 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000579538 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATT TATCCTTAAT AAAAAGAATA TTTGTTGCAA TTATTTTAGG AATACTTATT GGGCTAGGAT GTTCCTATAT TAATTTAGAT ATACCTATTA GAATATTAAT GACCTTTAAT AGCATATTTG GGAATTTACT AAGTTTCTTA ATCCCACTTA TAATAGTTGG GTTTATAGTT CCTGGTATAG CATCCTTAGG AAATAAATCA GGAAAGGGAC TTTTCATAAC TACTTTAATT TCATATGCTT CAACATTTTT AATAGGAATA CTTACTTTCT TTATAGGACG CGCAGTACTT CCTAAATTTA TAGTAAGTGC TTCTCTAAGC ACTGGATCAG TAAATGTTGA TCCTTATTTT ACAATTGATA TTCCTCCAAT GTTTGGTGTT ATGTCAGCTT TAGTTTTTGC ATTTTTATTA GGAATAGGAA TATCAAGAAT AAAAAATAGT TACTTATTAA AAGTATCAGA AGAATTTAAT CACGTTATTT CATTAACTAT AAAAAATGTG TTAATACCTT TAGTACCTAT TTACATACTT TCAATATTTT CAAAGTTAAG TTATAATGGT GAGATTTTTA CTACTTTAAA GTCTTTTGGA CTTGTGTACT TAGTTTTATT TTCAATACAA GGATCTTATT TAGTGGTTCA ATATGCTTTA GCTGGAACTT TAAAGAAAGA AAATCCATTA AAATTACTTA AAAATATGAT TCCTGCATAT ATGACAGCTT TGGGAACTCA ATCATCAGCA GCTACAATCC CAGTTACTTT AAACTGTACT AAGGAAAATA AAGTTGATCA AGATGTAGCA GATTTTGTTA TTCCTTTAGG AGCAACAATA AATTTAGCAG GTGATACTAT TACTTTAGTT CTTGCATCAA TGGCTGTAAT GTATATGAAA GGACAAGTTC CAACTTTCTC TGTTATGGTT CCATTTATAG TTATGTTAGG AGTAACTATG GTAGCAGCAC CAGGGGTACC AGGTGGCGGA GTTATGGCTG CTTTAGGATT ACTTGAAGGT ATGCTTGGAT TTGGTAATGT TGAAAAATCC TTAATGATAG CACTTCATGC TGCTCAAGAT AGTTTTGGAA CAGCAACTAA TGTAACTGGA GATGGGGCTA TAGCTATAAT AGTAGAATCA ATCTTAAAGA AAAGAAATAA TACTAACATT AAAATTGAAG AGGCTGAAGA AGACTTTATT CCAAAGGTTA GTTGTAATTA A
|
Protein sequence | MKNLSLIKRI FVAIILGILI GLGCSYINLD IPIRILMTFN SIFGNLLSFL IPLIIVGFIV PGIASLGNKS GKGLFITTLI SYASTFLIGI LTFFIGRAVL PKFIVSASLS TGSVNVDPYF TIDIPPMFGV MSALVFAFLL GIGISRIKNS YLLKVSEEFN HVISLTIKNV LIPLVPIYIL SIFSKLSYNG EIFTTLKSFG LVYLVLFSIQ GSYLVVQYAL AGTLKKENPL KLLKNMIPAY MTALGTQSSA ATIPVTLNCT KENKVDQDVA DFVIPLGATI NLAGDTITLV LASMAVMYMK GQVPTFSVMV PFIVMLGVTM VAAPGVPGGG VMAALGLLEG MLGFGNVEKS LMIALHAAQD SFGTATNVTG DGAIAIIVES ILKKRNNTNI KIEEAEEDFI PKVSCN
|
| |