Gene CPF_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1667 
Symbol 
ID4202172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1884013 
End bp1885158 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content30% 
IMG OID638082542 
Productaminotransferase, class V 
Protein accessionYP_696106 
Protein GI110798941 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTTT ATTTTGATAA TAGTGCTACT ACTAAGCCTT TAAAGGAAGT TAGAGATGAA 
GTTTATTATG CTATGGATGA ATTCTGGGGT AATCCATCAT CTTTACATAA ATTAGGGGTA
AAGATGCAAA GAAAGATTGA AGAGCTTCAA GAAAGTATTG CAAAAAAAAT AAATGCTTCT
AAGGAAGAAA TAATCTTTAC TTCAGGAGGA AGTGAAAGCA ATAATATGAT TATTAAAGGA
GTAGTTGGAG AAAATAATCA TATTATAACT ACAACCTTTG AACATTCTAG TGTTTTAAAT
ACTTATAGGG AATTAGAAAA GCAAGGTGTA AGTGTAACAT ATTTAAAGGT TAATAATAAA
GGTTTTATAG ATTTAAAAGA ATTAGAAGAG GCAATAAATA AAAATACGGT TTTAGTATCT
ATAATGCAGG TTAACAATGA AGTGGGAAGC GTACAAAAGA TTAAGGAAAT AGGAAGATTA
ATTAAAGAAA AAAGTAAAAG AGCAAAATTT CATGTAGATG GAGTACAGGG TTTTGGAAAA
TTTGAAATTG ATGTTAAGGC ATGTAATATA GATTTTTATT CTGTTTCAGC TCATAAGTTT
CATGGCCCAA AGGGAGTTGG ATTCATGTAT ATGAGAAAGG GATTAAATTT AAAATCCTTA
ATAACTGGTG GAGAACAACA AAGAGGACTA AGGGCAGGAA CGGAAAATAC TCCTTCGTAT
ATGGGCATGG TAAAGGCTAT GGATATTGCC TATGATTCCT TAGAAGATTC TTATAATCAT
GTAAAAAATC TTAAGGAGTA TTTTATAGAA AAACTTTCTA AAATAGAAAA TGTAGTAATA
AATAGCCCTA GTAGTGAAGA ATATAGTCCT TACATATTAA ATGTTTCTTT TTTAGGAATT
AGATCAGAGG TTTTACTTCA CATTTTAGAG GAGGATAACA TATTTGTTTC AACAGGGTCA
GCCTGTTCTT CAAAAGCTTC TATATCAAAG GGAAGTTACG TATTAAATGC TATGGGATTA
GAACCAAAGT GCATTCAAGG GGCTATAAGA TTTAGCTTTT CTAGATATAA CAATTTAGAA
GAAGTTGATT ACACCATAGC TTCACTAGAA AAAGCTTTAA AATTTTTAAG GAGAATAAAA
ATATGA
 
Protein sequence
MEVYFDNSAT TKPLKEVRDE VYYAMDEFWG NPSSLHKLGV KMQRKIEELQ ESIAKKINAS 
KEEIIFTSGG SESNNMIIKG VVGENNHIIT TTFEHSSVLN TYRELEKQGV SVTYLKVNNK
GFIDLKELEE AINKNTVLVS IMQVNNEVGS VQKIKEIGRL IKEKSKRAKF HVDGVQGFGK
FEIDVKACNI DFYSVSAHKF HGPKGVGFMY MRKGLNLKSL ITGGEQQRGL RAGTENTPSY
MGMVKAMDIA YDSLEDSYNH VKNLKEYFIE KLSKIENVVI NSPSSEEYSP YILNVSFLGI
RSEVLLHILE EDNIFVSTGS ACSSKASISK GSYVLNAMGL EPKCIQGAIR FSFSRYNNLE
EVDYTIASLE KALKFLRRIK I