Gene CPR_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1054 
Symbol 
ID4204607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1200884 
End bp1202392 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content30% 
IMG OID642565610 
ProductC4-dicarboxylate anaerobic carrier family protein 
Protein accessionYP_698376 
Protein GI110802357 
COG category[S] Function unknown 
COG ID[COG1288] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.544295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AGAAAAAAAT TTCATTCCCT ACAGCCTTTA CTGTATTATT TATTGTTTTA 
ATTTTATCAG CTATATTAAC TTATGTTATT CCAGCAGGAT CATATTCAAA ATTGTCTTAT
AATGAAGCTG AAAACACCTT TGTTGTTACA AATCCTCAAG GGGAAAGCAC TAAGGAAAAC
GCAACTCAAA ATACCTTAGA TAAACTTGGT ATAAAAATAA ACTTAAGTAA ATTTACTGAT
GGAAGTATAA ATAAACCAAT AGCTATACCA AATACTTATG AAAAGGTTTC TCAAAATCCT
CAAGGAATTT CTAAAATAAT AGAAGCTCCC ATTCAAGGAA CTTATGACAC TATAGATATA
ATTATGTTCG TTCTAATAAT AGGTGGAGTA ATTGGAGTTT TAAATGCTAC TGGAGCATTT
AATGCCGGAA TTGCTAGCCT TTCTAAAATA ACTAAAGGAA AAGAATATAT ACTTATAATA
TTATTATCAA TACTTATTTC TCTTGGTGGT ACTACTTTTG GATTGGCAGA AGAAACAATT
GCTCTTTATC CTATTTTACT CCCAATATTC CTAGCTTCTG GCTATGATGC TATAGTATGT
ATTGCTACAA TATATATGGG TTCATCTATA GGAACAATGT TCTCAACTGT AAACCCATTC
TCTTCAGTAA TAGCTTCAAC AGCCGCTGGA ATAAGCTTTA AAGAAGGCCT TGATTTTAGG
ATGATAGGAT TAGTTTTAGC TACACTTATA ACAATAATTT ATATACTTAG ATATGCTAAA
AAAGTTAAGA ATGATCCTTC TAAATCCCTT GTATATGATC AAAAAGATGA AATAGATTCT
AAATTTCTTC ATGAATCTAA TAATGATGTG CCAGTATTTA CTTGGAGACT TAAACTTATG
CTTTTAATAT TCGCTGGTTC ATTTGTAATT TTAGTTTATG GAGTTTCAGC TAAAGGATGG
GGATTTATAC AAATGACTGC TCTATTCCTT GTAGTTGGAA TAATTTTAGG TTTCCTTTCA
GGACTTGGAG AAAAGAAATT TGTTAATACA TTTATAGCTG GTGCTGCTGA TTTGGTAGGA
GTTGCCTTAG TTATAGGTGT TGCAAGATCT ATAAACTTAA TACTTGAAAA TGGTAAAATA
TCAGATACTT TACTTTATGT ATCCTCAAAT GGAATTCAAG GTATGGATAA AAATATATTT
ATAATATTAA TGCTTGTTAT ATTCATAATC TTAGGATTCT TTATTCCATC TTCATCTGGT
CTTGCTGTTT TATCAATTCC AATAATGGCA CCACTTGCAG ATACAGTTGG TTTACCAAGG
GATGTTATAG TTAGTGCTTA CCAATTTGGT CAAGGATTAA TCTCCTTTAT AACTCCAACA
GGATTAATTT TAGCTACCCT TGCTATGGTT GATGTAACCT ATAATAAATG GCTGAAATTT
ATTATGCCTT TAATGGGAAT TATAGCAGCC TTTGCAGCCT TACTATTATT AGTACAAGTA
CACTTTTAA
 
Protein sequence
MSKKKKISFP TAFTVLFIVL ILSAILTYVI PAGSYSKLSY NEAENTFVVT NPQGESTKEN 
ATQNTLDKLG IKINLSKFTD GSINKPIAIP NTYEKVSQNP QGISKIIEAP IQGTYDTIDI
IMFVLIIGGV IGVLNATGAF NAGIASLSKI TKGKEYILII LLSILISLGG TTFGLAEETI
ALYPILLPIF LASGYDAIVC IATIYMGSSI GTMFSTVNPF SSVIASTAAG ISFKEGLDFR
MIGLVLATLI TIIYILRYAK KVKNDPSKSL VYDQKDEIDS KFLHESNNDV PVFTWRLKLM
LLIFAGSFVI LVYGVSAKGW GFIQMTALFL VVGIILGFLS GLGEKKFVNT FIAGAADLVG
VALVIGVARS INLILENGKI SDTLLYVSSN GIQGMDKNIF IILMLVIFII LGFFIPSSSG
LAVLSIPIMA PLADTVGLPR DVIVSAYQFG QGLISFITPT GLILATLAMV DVTYNKWLKF
IMPLMGIIAA FAALLLLVQV HF