Gene CPF_2819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2819 
Symbol 
ID4201940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3080537 
End bp3081763 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content35% 
IMG OID638083686 
ProductNupC family nucleoside transporter 
Protein accessionYP_697183 
Protein GI110801028 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATAGGT TTATTGGTGT AATCGGTCTT ATTTGTATTA TTGGTATAGC TGTTCTTTTT 
TCTGAAAACA GAAAGAAGAT CAACTGGAGA TTGGTTGGAA CAGGTCTTTT ATTACAAATT
ATTTTCGCTT TATTAATCCT AAAAGTTCCT GCCGGTAGAG CAGTATTTGA ATGGATTAGT
AGCGGAATAA CTAAGTTATT AGATTTTACT AAAGAAGGTA GTTCATTCTT ATTTGGATCA
TTACTTGATA CAGACAAATT CGGTGTAATA TTTGCTCTAC AAGTATTACC AACTATTATC
TTCTTCTCAT CATTAATGAG TGTACTTTAT CATTTAGGTA TAGTTCAAGT AGTAGTTAAA
GTTATTGCTA AGGGTGTTGC TAAAGTATTA GGAACAAGTG GTGCTGAAAC TTTCAGTGCA
GTTGGTAATA TCTTTTTAGG TCAAACAGAG GCTCCTCTAC TAGTTAAACC ATACATAAAG
AACATGACTA GATCAGAAAT ATGTGCAATC ATGATAGGTG GTATGGCTAC TGTTGCCGGT
GGTGTTATGG CTGGTTATGT AGCTATGGGT GTTAACGCTG GTAACTTATT AGCAGCATCA
ATCATGGCAG CCCCTGCTGG ATTAATATTA GCTAAAATAT TAGTTCCTGA AACTGAAGTT
CCTGAAACTA AAGGTGGCGC AACTTTAGAA CTTAAAGTTG AAAGTGAAAA TGTTATTGAA
GCTGCTGCAA ACGGTGCTTC AGAAGGTTTA GGATTAGCTT TAAACGTTGG TGCTATGCTT
CTTGCATTCG TTGCTCTTAT AGCTATGATC AATGCTTTAT TTGGAGCAAT TGGTGGAATA
TTTGGTGCAC CTTGGTTAAG CTTAAACTGG ATTCTTGGTA GATTATTCTC TCCATTAGCA
TTTATAATGG GAGTTCCAAC TAAAGACGTT TTCGCAGCTG GAGACTTACT AGGAATTAAA
TTAGCAGTTA ATGAATTCTT AGCTTACTCA CAATTATCAA ACTACATAGC AAGCGGAACT
TTAGAACCTA AGACTATAAT GATATTAACT TATGCTCTTT GTGGATTCGC TAACTTAAGT
TCAGTTGCTA TACAATTAGG TGGTATCGGT GGATTAGCTC CAGAAAAGAA ACCAACTATA
GCTAAGTTAG GATTCAAAGC ACTTTTAGGT GGTGTATTAG CTACTTGTAT GACAGCTACT
ATAGCAGGTA TCTTATTTAG TGCTTAA
 
Protein sequence
MDRFIGVIGL ICIIGIAVLF SENRKKINWR LVGTGLLLQI IFALLILKVP AGRAVFEWIS 
SGITKLLDFT KEGSSFLFGS LLDTDKFGVI FALQVLPTII FFSSLMSVLY HLGIVQVVVK
VIAKGVAKVL GTSGAETFSA VGNIFLGQTE APLLVKPYIK NMTRSEICAI MIGGMATVAG
GVMAGYVAMG VNAGNLLAAS IMAAPAGLIL AKILVPETEV PETKGGATLE LKVESENVIE
AAANGASEGL GLALNVGAML LAFVALIAMI NALFGAIGGI FGAPWLSLNW ILGRLFSPLA
FIMGVPTKDV FAAGDLLGIK LAVNEFLAYS QLSNYIASGT LEPKTIMILT YALCGFANLS
SVAIQLGGIG GLAPEKKPTI AKLGFKALLG GVLATCMTAT IAGILFSA