Gene CPF_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1223 
Symbol 
ID4203555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1391813 
End bp1392976 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content31% 
IMG OID638082104 
Productsodium:dicarboxylate symporter family protein 
Protein accessionYP_695669 
Protein GI110800158 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00163296 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAT TATTTAATAA TCTTATTTTT AAACTTGTTT TAGGTGTTAT ATTAGGAATA 
ATAATAGGCA CATACTCTTC AGAGGGGCTT ATGTCAACAA TTGTGACAAT TAAGTATGTA
TTGGGACAAA TTATATTTTT CTCTGTTCCA CTTATTATTT TAGGATTTAT AGCTCCATCT
ATTGCTAAAT TAAAAGATAA CGCAAGCAAA CTATTAGGAT ATGCTGTTTT AATAGCTTAT
TTATCTTCAG TTTTTGCTGC TATTCTTTCA ATGATTGCAG GATATGCATT AATACCTAAA
TTATCTATAG TATCTAATAT AGCATCATTA AAGGAATTAC CAGAACTTAT ATTTAAATTA
GATATACCAC CAGTTATGAG TGTAATGAGT GCGTTAGCTT TAGCATTACT TTTAGGATTA
GCTGTTGGAT GGACAAAGGC TGATTTAGTA GAAAAGCTTT TAGATCAATT TCAAGCTATA
GTACTTAGTA TTGTAAATAA AATAATAATA CCAATATTAC CATTTTTCAT AGCAACTAAC
TTTGCAGCTT TAGCTTATGA AGGAGGATTA AGTAATCAAC TTCCTGTATT CTTTAAAGTT
ATATTAATTG TATTATTTGG TCATTTTATA TGGTTAACAA TTTTATATTT AATAGGTGGA
GCAATATCAA AAGAAAATCC ATGGGAAGTT GTAAAGTATT ATGGACCAGC ATATCTTACT
GCAGTAGGTA CAATGTCAAG TGCAGCAACA TTACCAGTAG CTTTAGAGTC TGCAAAGAAA
TCAAAGGCTT TAAGAGAAGA TATAGTGGAT TTTGCAATAC CATTATGTTC AAACATACAT
TTATGTGGTT CAGTTCTTAC AGAAGTATTC TTTGTAATGA CAGTATCTCA AATTTTATAT
GGTAAGATTC CGAGTTTACC AACTATGATA TTATTTATAG TATTATTAGG GGTGTTTGCA
ATCGGGGCAC CAGGAGTCCC TGGGGGGACA GTAATGGCAT CATTAGGTTT AATAATTAGT
GTATTAGCCT TTGATGAGGC TGGGACAGCT CTTATGTTAA CAATATTTGC TCTTCAAGAT
AGTTTTGGAA CAGCATGTAA TGTAACTGGT GATGGAGCAA TAGCTCTTAT GCTGACAGGT
ATAGCAAAGA AAAAGAACTT ATAA
 
Protein sequence
MKKLFNNLIF KLVLGVILGI IIGTYSSEGL MSTIVTIKYV LGQIIFFSVP LIILGFIAPS 
IAKLKDNASK LLGYAVLIAY LSSVFAAILS MIAGYALIPK LSIVSNIASL KELPELIFKL
DIPPVMSVMS ALALALLLGL AVGWTKADLV EKLLDQFQAI VLSIVNKIII PILPFFIATN
FAALAYEGGL SNQLPVFFKV ILIVLFGHFI WLTILYLIGG AISKENPWEV VKYYGPAYLT
AVGTMSSAAT LPVALESAKK SKALREDIVD FAIPLCSNIH LCGSVLTEVF FVMTVSQILY
GKIPSLPTMI LFIVLLGVFA IGAPGVPGGT VMASLGLIIS VLAFDEAGTA LMLTIFALQD
SFGTACNVTG DGAIALMLTG IAKKKNL