Gene CPR_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1168 
SymbolcitN 
ID4206188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1313000 
End bp1314340 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content31% 
IMG OID642565724 
Productcitrate/sodium symporter 
Protein accessionYP_698490 
Protein GI110803743 
COG category[C] Energy production and conversion 
COG ID[COG3493] Na+/citrate symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.226143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAC AAAATAAAAA GGTAGATGGT AAAGAAGAAA GTTTGCTTAA AAAGTTTTTT 
AATATAAATC TTTTTGGAGT GCCAATGTTA TTGTTTTTGG TTGGAGCAAT AATCATAATA
TTAGGTATAT CAACAAATTC ATTACCAAAG GATATGGTTG GAAGTATATT CTTAATATTT
ACTTGTGGTA TTGTTTTAGG TAAAATAGGT GATTCAATAC CTATCTGGAA AGATTATTTA
GGTGGAGGTG CTATTTTAGC ATTCTTAGTA ACTTCATATG CAGTTTATAT AGGATTAATT
CCTACCACTT ATGTAAAGGA TATCAAAGAA CTTTTTGATA GTGGATTCTT AGAGTTATAT
ATTTCAATAA TGATTTGTGG TTCTTTATTA GCAATAGATA GAAAATTCTT AGCAAAAACT
GTTGGTGGAT TTATACCTAT GGTAATTATT GCAACTTTAA CTGCTGCCTT AGGCGCTGTA
ATAGGAGGAT TAATTACAGG AGTTTCTCCA AAAGAAGTTA TATTAAATTA TGCACTTCCT
ATTATGGGTG GTGGAAATGG AGCAGGTGCT ATTCCTATGA GCCAAATTTG GGGACAAGTA
ACAGGAAAAG ATCCAAAGAT ATGGTATTCA TCAGCAATGG CAATATTAAC AATTGCAAAC
ATAATAGCAA TATTAGCAGG TGCTATTTTA AATGGTATAG GTAATAAAAA ACCTAAGCTT
ACAGGGAATG GAGAATTGAT TAAAGGAGTA AAAAATGCAG AATCTACAAA ATCTAATTTT
AAAGCAACCT TTGAAGATGG CGCAGCAGGT CTATTTATAA CACTTGCTTT CTACTGTTTA
GGAAATCTGT TTGGAAAAGG ATTCTTCCCA ACAATAGGTG GAGTTTTCAT TCATCCATTT
GCTTACATGG TTGTATTTGT ATTAATAGCT AGTGGATTTA ACTTAGTTCC AGAAAGAATT
AGAGTTGGTG CAAAACAAGT TCAAAAATTT ATGGTTGGTA ATTTATTCTA CGTTTTAATA
GCAGGAGTTG GTATAGCAAT GGTTGATTTT GGAGCTTTAT TAAAGGCATT CAATTTAACA
ACTGTTATAA TATCATTAGG TGCAGTTATA GGAGCTATAT TAGGTCCTTG GATAACTTCA
AAAATATTTG GTTTCTATCC TATAGAAGCT TCCATAGCAG CAGGATTATG CCATGTTAAT
AGAGGAGGTT CTGGAGACTT AGAAATATTA GGAGCTGCTA AGAGAATGAA TTTAATGGCA
TATGCTCAAA TAGCAACTAG ACTTGGTGGA GCAATTATAT TAGTTTTAGC AGGATTCCTA
TTTAGTTTAT GGCTAAAATA A
 
Protein sequence
MKLQNKKVDG KEESLLKKFF NINLFGVPML LFLVGAIIII LGISTNSLPK DMVGSIFLIF 
TCGIVLGKIG DSIPIWKDYL GGGAILAFLV TSYAVYIGLI PTTYVKDIKE LFDSGFLELY
ISIMICGSLL AIDRKFLAKT VGGFIPMVII ATLTAALGAV IGGLITGVSP KEVILNYALP
IMGGGNGAGA IPMSQIWGQV TGKDPKIWYS SAMAILTIAN IIAILAGAIL NGIGNKKPKL
TGNGELIKGV KNAESTKSNF KATFEDGAAG LFITLAFYCL GNLFGKGFFP TIGGVFIHPF
AYMVVFVLIA SGFNLVPERI RVGAKQVQKF MVGNLFYVLI AGVGIAMVDF GALLKAFNLT
TVIISLGAVI GAILGPWITS KIFGFYPIEA SIAAGLCHVN RGGSGDLEIL GAAKRMNLMA
YAQIATRLGG AIILVLAGFL FSLWLK