Gene CPF_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1202 
Symbol 
ID4203155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1367043 
End bp1368245 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content42% 
IMG OID638082083 
Producttriple helix repeat-containing collagen 
Protein accessionYP_695648 
Protein GI110799319 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000183423 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGAA AAATTTATAA TCCTAATAGA TATTATGATG ATTACAATAG ATATAATTGT 
TACGATAGAT ATAATTGCTA TGATGATGAG TATTGTCAAG ATGATTATTA TTGCAAGGAA
GACTGTTATT GTAAAGATGA TTGCTATTTA GAGATAAATT GCAATTGTTG CGATTGTTGT
AAACCTGGAC CAAGGGGTCC AAGAGGACCT CAAGGTCCTA GGGGTCCTCA AGGACCAAGA
GGTCCTATGG GATGTCAAGG TGAGCGTGGT CCAATAGGTC CTATGGGCCC TATGGGGCCT
ATTGGACCTC AAGGTCCACA AGGTGATCAA GGTCTTACTG GCCCTCAAGG CCCTGCTGGT
CCTCAAGGCG AACAAGGTCC ACAAGGTGAT CAAGGTCCTG TTGGTCCTAT AGGCCCTCAA
GGTCCTCAGG GTGAGCAAGG TCTTACTGGT CCTCAAGGAC CTGCTGGTTC TCAAGGCCCT
GAAGGTCCTA CTGGTCCTCA AGGTGCTACT GGCCCTCAAG GTCCTGAAGG TCCTACTGGT
GCTCAAGGAG ATCAAGGTCC TGTTGGTCCT CAAGGAGCTC AAGGTCCACA AGGCCCTCAA
GGTCCTCAAG GTGCTACCGG TCCTACTGGC CCACAGGGTC CTCAAGGTAA TCAAGGTCCT
GCTGGTCCTC AAGGCCCTGT TGGTCCTCAA GGTCCTCAAG GTGAACCTGG AGTGGATTTT
GATGATACCT TATTAGTTAG TTATTCCTCA TTAACTTCTC AAAATGTTAA TGCTAATGGT
ATATTCACTT ATAATATCCA AAATCCTAAT GGCTCAACTT TTACAGCAAT AACTGCCAAT
ATAGCAAACG GAACATTTAC AATAAATGAA CCTGGAAGAT ATTTATTTAT GTGGTCATTT
AATTTAGATA ACACAAATAA TACCACAGCT AGCGCTATAG TATCTTTATT TAGAAATGGT
TCTAGAGTAT TTTTATCTGG AACTCCTAGA GTAGCTCCTG GTGAAATAGG CGTAGTAAAT
GGAAGTATTG CCGTAAATGC TAATGCTGGT GATGTATTTG CTTTAGTTAA TAATTCTACA
AGAAACGTTT TATCACAAAT AATATCTTCA CCAATTTCTG TAACTCCAGC TATCTTAGGA
GAATCTACAG GAATAAATTC AGGAATAGGA TCTTGGGTTC AAATAGTTAG AGTATCTGAT
TAA
 
Protein sequence
MIRKIYNPNR YYDDYNRYNC YDRYNCYDDE YCQDDYYCKE DCYCKDDCYL EINCNCCDCC 
KPGPRGPRGP QGPRGPQGPR GPMGCQGERG PIGPMGPMGP IGPQGPQGDQ GLTGPQGPAG
PQGEQGPQGD QGPVGPIGPQ GPQGEQGLTG PQGPAGSQGP EGPTGPQGAT GPQGPEGPTG
AQGDQGPVGP QGAQGPQGPQ GPQGATGPTG PQGPQGNQGP AGPQGPVGPQ GPQGEPGVDF
DDTLLVSYSS LTSQNVNANG IFTYNIQNPN GSTFTAITAN IANGTFTINE PGRYLFMWSF
NLDNTNNTTA SAIVSLFRNG SRVFLSGTPR VAPGEIGVVN GSIAVNANAG DVFALVNNST
RNVLSQIISS PISVTPAILG ESTGINSGIG SWVQIVRVSD