Gene CPF_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0477 
Symbol 
ID4201237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp566079 
End bp567206 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content21% 
IMG OID638081359 
Productglycosyl transferase, group 1 
Protein accessionYP_694932 
Protein GI110798592 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0293777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATATT ATAAAGAAAG ATATAAAGTA ATAGTTATTA ATGATAAAAA GTTTGGTGAA 
AATACTAGTA ATCTTTGTAT AAAAAACTTT AAATATTATA CTGGAGAAAG TATTTTTAAA
TCAAATAATT ATAAAGAAAA GTATTTTAAA TTAAAAAGTC TTTTAAAAAA AAGTTTTTTA
ATTAAGTTAG CTAGAGGGTA TAAATTGAGT GAATTAAAAT TTTTAAAAGA TAACTCAGAT
ACAATTAATG AAATTATTAA TTATATAAGA GAAGAAAGAA TAACATTAGT TTATATAACG
GTACCAGATT TATATCCAAT ATATATTGCA AAGTGCATAA AGAAAGAGCT ATCAAATGTT
AAAATTATTA CAGAAGTAAG GGACATTTTA AATCATAATA TAGGAGGTGG AAATCCTAAG
TTTGTTTTAA AAAAAGCTGA AAAAATAATG TTAAATATAT CAGATGAGTT AATAGTATTA
TCAGAAGGTA TATATGATTA TTATAAAGAC AACTTTAATG AAACTAAAAT AAGTATAATT
AAAAATGGTT ATAATGAAAA ACTTTTTGAA AATTTAAATA AAATAAATTT AAAAAAAGAT
AAATTAACAT TGGCCCATAT TGGATCTATT TATAAAGGAA GAAACATAAA GAATTTTATA
TTAGCGTTAA ATAAATTTTC AATAGAAGAA TCAAAAAATA TAGTATTTAA TATAGTAGGT
TATCTAGATG ATATAGCTCA AAAAGATTTA GAGGAACTAG ATATAGATAA TTTAAATATA
GAAATTAATA TAATTGGTAC AGTTACTCAT GAAAAAGCTG TTGAATATTT AATAAATTCT
GATATAGCGG TTATTTTGAC ACATATAAAA GGGAGTGGGT ATGCTATACC AGGGAAAGTT
TTTGAATATA TTGGAGCTGA AAAACCAATA TTGGCTGTTA CTGAAGACTT GCCACTAATA
AATTTAATAA ATAGTAAATA TGGGGTGTGT TCAAAACATA ATATAAATAG TATAGTTCTT
TCAATGAAAA AATTACTGAA AATGAATTTT GATTTTGAAG ATAAAATTAA ATTTACAAGA
AAGAAACAAG CTGAGAAAAT ATTGAAAATA CTTGAAAAAT ATATTTAG
 
Protein sequence
MEYYKERYKV IVINDKKFGE NTSNLCIKNF KYYTGESIFK SNNYKEKYFK LKSLLKKSFL 
IKLARGYKLS ELKFLKDNSD TINEIINYIR EERITLVYIT VPDLYPIYIA KCIKKELSNV
KIITEVRDIL NHNIGGGNPK FVLKKAEKIM LNISDELIVL SEGIYDYYKD NFNETKISII
KNGYNEKLFE NLNKINLKKD KLTLAHIGSI YKGRNIKNFI LALNKFSIEE SKNIVFNIVG
YLDDIAQKDL EELDIDNLNI EINIIGTVTH EKAVEYLINS DIAVILTHIK GSGYAIPGKV
FEYIGAEKPI LAVTEDLPLI NLINSKYGVC SKHNINSIVL SMKKLLKMNF DFEDKIKFTR
KKQAEKILKI LEKYI