Gene CPR_1609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1609 
Symbol 
ID4206186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1799803 
End bp1801578 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content38% 
IMG OID642566160 
ProductV-type ATP synthase subunit A 
Protein accessionYP_698925 
Protein GI110803741 
COG category[C] Energy production and conversion 
COG ID[COG1155] Archaeal/vacuolar-type H+-ATPase subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.061044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAACGG GGAAAATTAT CAAGGTTTCA GGTCCTTTAG TAGTTGCTGA AGGTATGGAT 
GAAGCTAATG TATATGACGT TGTAAAAGTT GGAGAAAAAG GTCTTATCGG AGAGATCATT
GAAATGAGAG GAGATAAGGC TTCAATCCAG GTATATGAAG AAACATCAGG TATTGGACCT
GGGGACCCAG TTATAACTAC TGGAGAACCA CTTTCAGTAG AATTAGGACC AGGACTTATA
GAGTCAATGT TCGATGGAAT ACAAAGACCT CTAGACGCTT TCATGAAAGC AGCTAATTCT
GCTTTCTTAA GTAAAGGGGT AGAAGTTAAA TCTTTAAATA GAGAGAAAAA ATGGCCTTTT
GTGCCAACTG CTAAAGTTGG AGATAAGGTT TCAGCTGGAG ACGTAATAGG AACAGTTCAA
GAGACTGCCG TTGTTCTTCA TAGAATAATG GTTCCTTTCG GCGTTGAAGG TACAATAAAA
GAAATCAAAG CTGGAGATTT CAATGTAGAA GAAGTTATCG CTGTAGTAGA AACTGAAAAA
GGTGATAAGA ATTTAACATT AATGCAAAAA TGGCCTGTAA GAAAAGGTAG ACCATATGCA
AGAAAATTAA ATCCAGTTGA GCCAATGACT ACAGGACAAA GAGTTATAGA TACTTTCTTC
CCAGTAGCTA AAGGTGGAGC TGCTGCCGTT CCAGGACCTT TCGGAGCTGG TAAAACAGTA
GTTCAGCACC AAGTTGCTAA ATGGGGAGAT ACTGAGATAG TTGTTTATGT TGGATGTGGA
GAACGTGGTA ACGAAATGAC AGACGTTCTT AACGAATTCC CAGAACTTAA AGACCCTAAA
ACTGGGGAAA GCTTAATGAA GAGAACAGTT CTTATCGCTA ATACATCTAA CATGCCAGTT
GCTGCCAGAG AAGCATCAAT ATATACTGGT ATAACAATAG CAGAGTATTT CAGAGATATG
GGATACTCAG TATCAATCAT GGCTGACTCA ACTTCACGTT GGGCAGAGGC TTTAAGAGAA
ATGTCAGGAA GACTTGAAGA AATGCCAGGA GACGAAGGTT ACCCAGCATA CTTAGGATCA
AGACTTGCTG ATTACTATGA AAGAGCTGGT AAGGTTGTAG CTTTAGGTAA AGATGGAAGA
GAAGGAGCTG TTACAGCTAT CGGAGCAGTA TCCCCTCCAG GAGGAGATAT ATCTGAGCCA
GTTACACAAT CAACTTTAAG AATAGTTAAA GTTTTCTGGG GACTAGATGC TCAATTAGCA
TATAAGAGAC ACTTCCCATC AATTAACTGG TTAACATCAT ACTCATTATA CTTAGAAAAA
ATGGGTGAAT GGATGGATGC TCACGTAGCA GACGATTGGT CAGCATTAAG AACAGAAGCT
ATGGCACTTC TTCAAGAAGA AGCAAACTTA GAAGAAATAG TAAGACTTGT TGGTATGGAT
GCACTTTCAG AAGGTGATAG ATTAAAACTT GAAGTTGCTA AGTCAATAAG AGAAGACTAT
TTACAACAAA ACGCATTCCA TGAGAATGAC ACATATACTT CATTAAATAA ACAGTATAAA
ATGTTAAACT TAATCTTAAG TTTCAGACAT GAGGCTGAAA AAGCTTTAGA AGCTGGAGTT
TATTTAGATA AAGTATTAAA ACTTCCTGTT AGAGATAGAA TTGCAAGAAG TAAATATATT
TCAGAAGAAG AGATAAGTAA GATGGATGAC ATCTTAGTTG AATTAAAATC AGAGATGAAC
AAGTTAATCA GCGAGGGAGG TGTTCTAAAT GCTTAA
 
Protein sequence
MKTGKIIKVS GPLVVAEGMD EANVYDVVKV GEKGLIGEII EMRGDKASIQ VYEETSGIGP 
GDPVITTGEP LSVELGPGLI ESMFDGIQRP LDAFMKAANS AFLSKGVEVK SLNREKKWPF
VPTAKVGDKV SAGDVIGTVQ ETAVVLHRIM VPFGVEGTIK EIKAGDFNVE EVIAVVETEK
GDKNLTLMQK WPVRKGRPYA RKLNPVEPMT TGQRVIDTFF PVAKGGAAAV PGPFGAGKTV
VQHQVAKWGD TEIVVYVGCG ERGNEMTDVL NEFPELKDPK TGESLMKRTV LIANTSNMPV
AAREASIYTG ITIAEYFRDM GYSVSIMADS TSRWAEALRE MSGRLEEMPG DEGYPAYLGS
RLADYYERAG KVVALGKDGR EGAVTAIGAV SPPGGDISEP VTQSTLRIVK VFWGLDAQLA
YKRHFPSINW LTSYSLYLEK MGEWMDAHVA DDWSALRTEA MALLQEEANL EEIVRLVGMD
ALSEGDRLKL EVAKSIREDY LQQNAFHEND TYTSLNKQYK MLNLILSFRH EAEKALEAGV
YLDKVLKLPV RDRIARSKYI SEEEISKMDD ILVELKSEMN KLISEGGVLN A