Gene CPR_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1036 
Symbol 
ID4204398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1180093 
End bp1181256 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content30% 
IMG OID642565593 
Productsodium:dicarboxylate symporter family protein 
Protein accessionYP_698359 
Protein GI110801588 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000829053 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAT TATTTAATAA TCTTATTTTC AAATTAATTT TAGGTGTTAT ATTAGGAATA 
ATAATAGGCA CATACTCTTC AGAGGGGCTT ATGTCAACAA TTGTGACAAT TAAGTATGTA
TTGGGACAAA TTATATTTTT CTCTGTTCCA CTTATTATTT TAGGGTTTAT AGCGCCATCT
ATTGCTAAGT TAAAAGATAA TGCAAGCAAA TTATTAGGAT ATGCTGTTTT AATAGCTTAT
TTATCTTCAG TTTTTGCTGC TATTCTTTCA ATGATTGCAG GATATGCATT AATACCTAAA
TTATCTATAG TATCTAATAT AGCATCATTA AAGGAATTAC CAGAACTTAT ATTTAAATTA
GATATACCAC CAGTTATGAG TGTAATGAGT GCATTAGCTT TAGCATTACT TTTAGGATTA
GCTGTTGGAT GGACAAAGGC TGATTTAGTA GAAAAGCTTT TAGATCAATT TCAAGCTATA
GTACTTAGTA TTGTAAATAA AATAATAATA CCAATATTAC CATTTTTCAT AGCAACTAAC
TTTGCAGCTT TAGCATATGA AGGAGGATTA AGTAATCAAC TTCCTGTATT CTTTAAAGTT
ATATTAATTG TATTATTTGG TCATTTTATA TGGTTAACAA TTTTATATTT AATAGGTGGA
GCAATATCAA AAGAAAATCC ATGGGAAGTT GTAAAATACT ATGGACCAGC ATATCTTACT
GCAGTTGGTA CAATGTCAAG TGCAGCAACA TTACCAGTAG CTTTAGAGTC TGCAAAGAAA
TCAAAGGCTT TAAGAGAAGA TATAGTTGAT TTTGCAATAC CATTATGTTC AAATATACAT
TTATGTGGTT CAGTTCTTAC AGAGGTATTT TTTGTAATGA CAGTATCTCA AATTTTATAT
GGTAAGATTC CTAGTTTACC AACTATGATA TTGTTTATAG TATTATTAGG AGTGTTTGCA
ATAGGGGCAC CAGGAGTCCC AGGGGGAACT GTAATGGCAT CATTAGGTTT AATAATTAGT
GTATTAGCCT TTGATGAGGC TGGTACAGCT CTTATGTTAA CAATATTTGC TCTTCAAGAT
AGTTTTGGAA CAGCATGTAA TGTAACTGGT GATGGAGCAA TAGCTCTTAT GCTGACAGGT
ATAGCAAAGA AAAAGAATTT ATAA
 
Protein sequence
MKKLFNNLIF KLILGVILGI IIGTYSSEGL MSTIVTIKYV LGQIIFFSVP LIILGFIAPS 
IAKLKDNASK LLGYAVLIAY LSSVFAAILS MIAGYALIPK LSIVSNIASL KELPELIFKL
DIPPVMSVMS ALALALLLGL AVGWTKADLV EKLLDQFQAI VLSIVNKIII PILPFFIATN
FAALAYEGGL SNQLPVFFKV ILIVLFGHFI WLTILYLIGG AISKENPWEV VKYYGPAYLT
AVGTMSSAAT LPVALESAKK SKALREDIVD FAIPLCSNIH LCGSVLTEVF FVMTVSQILY
GKIPSLPTMI LFIVLLGVFA IGAPGVPGGT VMASLGLIIS VLAFDEAGTA LMLTIFALQD
SFGTACNVTG DGAIALMLTG IAKKKNL