Gene CPR_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1019 
Symbol 
ID4206584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1161048 
End bp1162355 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content30% 
IMG OID642565576 
Productsodium:dicarboxylate symporter family protein 
Protein accessionYP_698342 
Protein GI110801814 
COG category[R] General function prediction only 
COG ID[COG1823] Predicted Na+/dicarboxylate symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00254139 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAATA TTAAATTAAT ATTAGCAATT GTGCTTTCAA TATTAGCAAC ATATGCTATA 
TATAAAATAA GAAAAATTAC AAATAAATTT TCATTTGCTA CGCTGACAGC ATTAACTTTA
GGGGTAGTAC TTGGAATTAT ATTTAAGGAG AATATACTAT TTTTAGATAC AGTAGGAAAG
GCTTATATGT CTTTAATTAA GATGATAGTA GTTCCTTTAG TAGTAACATC CTTAATAACT
AGTATTGTTA GATTAGAAAA TTTAGATACA TTAAAATCAA TAGGATTAAA AACATTTACT
GTTTTATTAG GAACTACAGG AGCTGCAGCC TTTATAGGAA TTATTGTAGC TAGTTCTTTA
AATCTTGGAC AAGGTTTAAG ATTTATAGGG GCTGAAAATT TTAAGGCAAG AGAAATACCA
GGGTTTTCTA AGGTACTTAT AGATATGCTA CCATCAAATC CTTTAGCGGC TATTGTAGAG
AATAAAATAA TACCAATAGT TATTTTTTCA ATGTTTATAG CAATTGCCTT AGTTATTGAA
GATAATACTA ATAAAGAAAA AGCAAAGCCA TTTAAAGATT TTATTTTATC AGCTTATGAT
ATAGTTTTAA GAATAACTAA GATGGTATTA AGAATAATAC CATATGGAGT ATTTGCCTTA
ATAGCTACAG CGGCAGCTAA AAATGGAATG GATACTTTGA TGTCATTAAT ATGGGTAATA
CTAGCTGTTT ATATAGCTGC CTTTCTTCAA TTTTTATTTG TATATACTCC ATTAATAAGC
TTTGTTGCAA GAATGAATCC ATTAAAATTC TTTAAAGGAA TTTTTCCGGC ACAGGTTGTA
GCTTTTACAA GTCAAAGTAG TTATGGTACT TTACCTGTTA CAATAAAATC TTTAGTAGAG
GGTGTTGGAG TATCAGAAAA TATAGCAAGC TTTGTAGCAC CACTTGGATC AACAATTGGA
CTAAATGGAT GTGGAGGTTT TTATCCAGCA ATAGTTGCAA TATTTGCAGC CAATGTTTTT
AATGTAGAAC TTACTATTTA TTCATACATA CTTATAGTTT TAACTGCTAT AATATCTTCC
ATAGGAATAG CAGGGGTACC TGGATCAGCA ACAATGTCAA CAACTGTAAT GTTAGCGGCT
TTAGGATTAC CAATAGAAGC ATTAGCAATG GTGATTGCAG TAGATTCTAT AATTGATATG
ATAAGAACTG CCACAAATGT AACAGGGGCT TCAGTTGCTG CATTAATAGT TGATCAAACA
GAAAAAAGAA AAGAATATAA AGTTGAAGAA TCAGTACAAA GAGCATAA
 
Protein sequence
MINIKLILAI VLSILATYAI YKIRKITNKF SFATLTALTL GVVLGIIFKE NILFLDTVGK 
AYMSLIKMIV VPLVVTSLIT SIVRLENLDT LKSIGLKTFT VLLGTTGAAA FIGIIVASSL
NLGQGLRFIG AENFKAREIP GFSKVLIDML PSNPLAAIVE NKIIPIVIFS MFIAIALVIE
DNTNKEKAKP FKDFILSAYD IVLRITKMVL RIIPYGVFAL IATAAAKNGM DTLMSLIWVI
LAVYIAAFLQ FLFVYTPLIS FVARMNPLKF FKGIFPAQVV AFTSQSSYGT LPVTIKSLVE
GVGVSENIAS FVAPLGSTIG LNGCGGFYPA IVAIFAANVF NVELTIYSYI LIVLTAIISS
IGIAGVPGSA TMSTTVMLAA LGLPIEALAM VIAVDSIIDM IRTATNVTGA SVAALIVDQT
EKRKEYKVEE SVQRA