Gene CPF_1466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1466 
Symbol 
ID4203283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1651004 
End bp1652455 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content32% 
IMG OID638082344 
Productputative lipoprotein 
Protein accessionYP_695909 
Protein GI110801108 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAAA ACAAAAAACT GATCGCTTTA TTATTAGGAG GAATAATTGG AACATCTTCT 
ATCTTAGCTG GATGTGGCTC AGGGGGGAGT GCTTCATCTA GTGATGGAGA AGGAGAAAAA
CCAGTAAACT TAGTATGGTA TGTAATCGGT AAACCACAAA ATGATGGAGA ATTAGTTGAG
GAAGAGGTAA ATAAGTATAT AAAGGATAAA ATAAATGCTA CTGTAGACAT AAAACATATT
GACTTTGGTG ATTATAGTCA AAAAATGAAC GTAATAGCTA ACTCAGGAGA AGAATATGAT
TTAGCATTTA CATGTTCATG GGCTTTCCCA TACTTAGATA ATGCAAGAAA GGGAGCTTTC
TTAGAGTTAA ATGATCTTAT AGATAGTCAT GGAAAAGATC TTAAGAATGT TATTGATGAA
AGACTTTGGA AGGGTGCAGA AGTTGATGGA AACATATATG CAGTGCCAAA CCAAAAGGAA
ATAGCAGGAG CACCTATGTG GGTATTTGAT AAGGAACTTG TTGAGAAATA TGATATTCCT
TACCAAGATA TTCATTCCGT TGATGATTTA GAACCATGGC TTGCACTTAT AAAAGAAAAG
GAACCAGATT TTGTTCCATT CTATACTCAA GGAGATGGAA TTCCTATAGA AGGTATAGAG
GATATAACAT CAGGCTTAGG TATTTTCTAT GATGATAAAA GCTTAACAGT TAAAAATATG
TATGAAACAG AGGAGCTAAA ACATCTTTTC ACTAAATTAA GAGAATTCTA TGAAAAAGGA
TATATAAATC AAGATGCAGC AGTTAGTAAT ATGAAAAATG AAGTTAAGAG ATTTGTGTGG
AAGGCTGATG GACAACCATA TGCTGAAAAT GGATGGAGTC AATCTTTAGG TAGAGAGGTT
GTAACTTCAT CAATAGTTTC TTCATATGTT ACAAATGCAT CAACTACAGG TGCTATGACT
GCTATATCAG CAACATCTAA GCATCCAGAA AAGGCTATGG AACTTATAAA CTTAGTAAAT
AAAGATTCTA CATTAAGAAA TCTATTAATG TTTGGAATAG AGGGAACTCA CTATGAAAAG
GTTAGTGATA ATCAAATAAA GAGAGATCCA AATGGACCAT ATAGTGTTAC AAGTTGGGCT
TATGGAAACT TATTTGATAC TTATGTTTTA GATAGTGACC CAGTAGATAA GTGGGATGCT
TTTGAGGAAT TTAACCAAAA GGCTAAAACT TCAACTATAT TAGGATTTAA ATTTGATACA
GAAAAAGTTG TAACTCAAAT GTCAGCTGTA AGTAATGCTT TTGAAGAGTT TATTAAACCT
TTATATACTG GTTCAGTAGA TACTGAAGAG ACTTTAGAAA AGTTAAATAA GAAGCTATAT
GATTCAGGTC TAGAAGATAT AAAAGTTGAG TTACAAAGAC AATTAGATGA GTGGAAAAAA
GAAAATAAAT AG
 
Protein sequence
MLKNKKLIAL LLGGIIGTSS ILAGCGSGGS ASSSDGEGEK PVNLVWYVIG KPQNDGELVE 
EEVNKYIKDK INATVDIKHI DFGDYSQKMN VIANSGEEYD LAFTCSWAFP YLDNARKGAF
LELNDLIDSH GKDLKNVIDE RLWKGAEVDG NIYAVPNQKE IAGAPMWVFD KELVEKYDIP
YQDIHSVDDL EPWLALIKEK EPDFVPFYTQ GDGIPIEGIE DITSGLGIFY DDKSLTVKNM
YETEELKHLF TKLREFYEKG YINQDAAVSN MKNEVKRFVW KADGQPYAEN GWSQSLGREV
VTSSIVSSYV TNASTTGAMT AISATSKHPE KAMELINLVN KDSTLRNLLM FGIEGTHYEK
VSDNQIKRDP NGPYSVTSWA YGNLFDTYVL DSDPVDKWDA FEEFNQKAKT STILGFKFDT
EKVVTQMSAV SNAFEEFIKP LYTGSVDTEE TLEKLNKKLY DSGLEDIKVE LQRQLDEWKK
ENK