Gene CPF_1370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1370 
Symbol 
ID4202971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1548207 
End bp1549427 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content30% 
IMG OID638082251 
Productsodium:dicarboxylate symporter family protein 
Protein accessionYP_695816 
Protein GI110800194 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000579538 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATT TATCCTTAAT AAAAAGAATA TTTGTTGCAA TTATTTTAGG AATACTTATT 
GGGCTAGGAT GTTCCTATAT TAATTTAGAT ATACCTATTA GAATATTAAT GACCTTTAAT
AGCATATTTG GGAATTTACT AAGTTTCTTA ATCCCACTTA TAATAGTTGG GTTTATAGTT
CCTGGTATAG CATCCTTAGG AAATAAATCA GGAAAGGGAC TTTTCATAAC TACTTTAATT
TCATATGCTT CAACATTTTT AATAGGAATA CTTACTTTCT TTATAGGACG CGCAGTACTT
CCTAAATTTA TAGTAAGTGC TTCTCTAAGC ACTGGATCAG TAAATGTTGA TCCTTATTTT
ACAATTGATA TTCCTCCAAT GTTTGGTGTT ATGTCAGCTT TAGTTTTTGC ATTTTTATTA
GGAATAGGAA TATCAAGAAT AAAAAATAGT TACTTATTAA AAGTATCAGA AGAATTTAAT
CACGTTATTT CATTAACTAT AAAAAATGTG TTAATACCTT TAGTACCTAT TTACATACTT
TCAATATTTT CAAAGTTAAG TTATAATGGT GAGATTTTTA CTACTTTAAA GTCTTTTGGA
CTTGTGTACT TAGTTTTATT TTCAATACAA GGATCTTATT TAGTGGTTCA ATATGCTTTA
GCTGGAACTT TAAAGAAAGA AAATCCATTA AAATTACTTA AAAATATGAT TCCTGCATAT
ATGACAGCTT TGGGAACTCA ATCATCAGCA GCTACAATCC CAGTTACTTT AAACTGTACT
AAGGAAAATA AAGTTGATCA AGATGTAGCA GATTTTGTTA TTCCTTTAGG AGCAACAATA
AATTTAGCAG GTGATACTAT TACTTTAGTT CTTGCATCAA TGGCTGTAAT GTATATGAAA
GGACAAGTTC CAACTTTCTC TGTTATGGTT CCATTTATAG TTATGTTAGG AGTAACTATG
GTAGCAGCAC CAGGGGTACC AGGTGGCGGA GTTATGGCTG CTTTAGGATT ACTTGAAGGT
ATGCTTGGAT TTGGTAATGT TGAAAAATCC TTAATGATAG CACTTCATGC TGCTCAAGAT
AGTTTTGGAA CAGCAACTAA TGTAACTGGA GATGGGGCTA TAGCTATAAT AGTAGAATCA
ATCTTAAAGA AAAGAAATAA TACTAACATT AAAATTGAAG AGGCTGAAGA AGACTTTATT
CCAAAGGTTA GTTGTAATTA A
 
Protein sequence
MKNLSLIKRI FVAIILGILI GLGCSYINLD IPIRILMTFN SIFGNLLSFL IPLIIVGFIV 
PGIASLGNKS GKGLFITTLI SYASTFLIGI LTFFIGRAVL PKFIVSASLS TGSVNVDPYF
TIDIPPMFGV MSALVFAFLL GIGISRIKNS YLLKVSEEFN HVISLTIKNV LIPLVPIYIL
SIFSKLSYNG EIFTTLKSFG LVYLVLFSIQ GSYLVVQYAL AGTLKKENPL KLLKNMIPAY
MTALGTQSSA ATIPVTLNCT KENKVDQDVA DFVIPLGATI NLAGDTITLV LASMAVMYMK
GQVPTFSVMV PFIVMLGVTM VAAPGVPGGG VMAALGLLEG MLGFGNVEKS LMIALHAAQD
SFGTATNVTG DGAIAIIVES ILKKRNNTNI KIEEAEEDFI PKVSCN