Gene CPF_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2004 
Symbol 
ID4202988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2242630 
End bp2244027 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content31% 
IMG OID638082873 
Productxanthine/uracil permease family protein 
Protein accessionYP_696437 
Protein GI110799866 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.461655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA AAATTCATGC TTTAAGGGAA GAAGGAAATT TACGTGTTTT ACCTGAGAAT 
AAAAGTGAAT ATAAGAGAGA ATTCTTAGCT GGAACAACAA GTTTCTTAGC AATGGCTTAT
ATAATAGCTG TAAATCCATC TATATTAAGC GCAGCAGGAA TGCCAGCAGG TGCTATAGTA
ACGGCAACAT GTATATCAGC AGTTATCGGA TGTTTAATAA TGGGGTTTTA TGCTAAATTA
CCTTTTGGAC TAGCTCCTGG AATGGGACTA AATGCATTCT TTACTTTTTC AGTAGTTATA
GGAATGGGAA TTTCTTGGGA AGTAGCTCTT ACAGCTGTTT TTGTAGAAGG AATAATATTT
ATACTTTTAT CATTATTTAA AGTAAGAGAG GCTGTTGTTG ATGCAATTCC AATAAATTTA
AAATATGCAG TTACAGCAGG GATAGGTCTT TTCATAGCTT TCATAGGATT TAATGGAGCT
GGAGTTGTTA TTGGAAATCC AGATACAATG GTTGCTATGG GACAAGTTGG TCCTAAAATG
TTAATAGCAA TGGTTGGACT TTGTATAATA GTAATTTTAG AAAAGAAAAA AGTTAAAGGT
TCAATGCTAG TTGGTATAGT AGTTTCAACT CTTTTAGCTT GGGGATATGC TTTAATAAAT
ACTGAAGCCG CAGCTAGTAT GGGAATCTAT TTACCAAATG GAATATTTAA ATTTGAATCA
ATAGCTCCAA TAGCAGGTAA AGTTAATTTT TCATATTTAA CTTCACCACA GCATGTATTT
AATTTTATAA CTATAGTTTT CACATTTTTA TTTGTTGATT TTTTTGATAC AGTAGGAACT
TTAATAGGAG TAGCTTCAAG AGCTAATATG TTAGATAAGA AGGGAAGAGT TCCTAATGCA
GGTAAAGCAT TAATGACAGA TGCTATAGCA ACTACGGCAG GGGCTTTACT TGGAACATCT
ACTGTAACAG TTTATGTTGA AAGTGCTACT GGAGTTGAAG AAGGTGGTAG AACAGGATTA
ACAGCTATAA CAATAGGAGC TTTATTTTTC GTAGCAATGT TTTTCTCACC AATATTTGTA
GCAGTACCAG CATGTGCTAC TGCACCAGCT TTAATATATG TTGGATATTT AATGCTAACT
AGTGTGTTAA AAATAGATTT TAGTGATATT ACAGATGCAG TACCAGCATT TTTAATAATA
GCTTTAATGC CTTTAACTTA TAGCATAGGT GATGGATTAA CAATTGGAGT TTTAGCATAT
GTAATATTAA ATATATTACA CAATATCTTT ACTAAAAATA AAAAAGATAA AAAAGAATTA
TCAATGGTAA TGATAGTTTT AGCGATTATA TTTGTAATAA AACTTTGTCT ACCATTAATT
ACACAGATGA TAGGTTAA
 
Protein sequence
MENKIHALRE EGNLRVLPEN KSEYKREFLA GTTSFLAMAY IIAVNPSILS AAGMPAGAIV 
TATCISAVIG CLIMGFYAKL PFGLAPGMGL NAFFTFSVVI GMGISWEVAL TAVFVEGIIF
ILLSLFKVRE AVVDAIPINL KYAVTAGIGL FIAFIGFNGA GVVIGNPDTM VAMGQVGPKM
LIAMVGLCII VILEKKKVKG SMLVGIVVST LLAWGYALIN TEAAASMGIY LPNGIFKFES
IAPIAGKVNF SYLTSPQHVF NFITIVFTFL FVDFFDTVGT LIGVASRANM LDKKGRVPNA
GKALMTDAIA TTAGALLGTS TVTVYVESAT GVEEGGRTGL TAITIGALFF VAMFFSPIFV
AVPACATAPA LIYVGYLMLT SVLKIDFSDI TDAVPAFLII ALMPLTYSIG DGLTIGVLAY
VILNILHNIF TKNKKDKKEL SMVMIVLAII FVIKLCLPLI TQMIG