Gene CPF_1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1890 
Symbol 
ID4201875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2125752 
End bp2127527 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content38% 
IMG OID638082759 
ProductV-type ATP synthase subunit A 
Protein accessionYP_696323 
Protein GI110798759 
COG category[C] Energy production and conversion 
COG ID[COG1155] Archaeal/vacuolar-type H+-ATPase subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAACGG GGAAAATTAT CAAGGTTTCA GGTCCTTTAG TAGTTGCTGA AGGTATGGAT 
GAAGCTAATG TATATGACGT TGTAAAAGTT GGAGAAAAGG GTCTTATCGG AGAGATCATT
GAAATGAGAG GAGATAAGGC TTCAATCCAG GTATATGAAG AAACATCAGG TATTGGACCT
GGGGACCCAG TTATAACTAC TGGAGAACCA CTTTCAGTAG AATTAGGACC AGGACTTATA
GAGTCAATGT TCGATGGAAT ACAAAGACCT CTAGACGCTT TCATGAAAGC AGCTAATTCT
GCTTTCTTAA GTAAAGGGGT AGAAGTTAAA TCTTTAAATA GAGAGAAAAA ATGGCCTTTT
GTGCCAACTG CTAAGGTTGG AGATAAGGTT TCAGCTGGAG ACGTAATAGG AACAGTTCAA
GAGACTGCCG TTGTTCTTCA TAGAATAATG GTTCCTTTCG GCGTTGAAGG TACAATAAAA
GAAATCAAAG CTGGAGATTT CAATGTAGAA GAAGTTATCG CTGTAGTAGA AACTGAAAAA
GGTGATAAGA ATTTAACATT AATGCAAAAA TGGCCTGTAA GAAAAGGTAG ACCATATGCA
AGAAAATTAA ATCCAGTTGA GCCAATGACT ACAGGACAAA GAGTTATAGA TACTTTCTTC
CCAGTAGCTA AAGGTGGAGC TGCTGCCGTT CCAGGACCTT TCGGAGCTGG TAAAACAGTA
GTTCAGCACC AAGTTGCTAA ATGGGGAGAT ACTGAGATAG TTGTTTACGT TGGATGTGGA
GAACGTGGTA ACGAGATGAC AGACGTTCTT AACGAATTCC CAGAACTTAA AGACCCTAAA
ACTGGGGAAA GCTTAATGAA GAGAACAGTT CTTATTGCTA ATACATCTAA CATGCCAGTT
GCTGCCAGAG AAGCATCAAT ATATACTGGT ATAACAATAG CAGAGTATTT CAGAGATATG
GGATACTCAG TATCAATCAT GGCTGACTCA ACTTCACGTT GGGCAGAGGC TTTAAGAGAA
ATGTCAGGAA GACTTGAAGA AATGCCAGGA GACGAAGGTT ACCCAGCATA CTTAGGATCA
AGACTTGCTG ATTACTATGA AAGAGCTGGT AAAGTTGTAG CTTTAGGTAA AGATGGAAGA
GAAGGAGCTG TTACAGCTAT CGGAGCAGTA TCCCCTCCAG GAGGAGATAT ATCTGAGCCA
GTTACACAAT CAACTTTAAG AATAGTTAAA GTTTTCTGGG GACTAGATGC TCAATTAGCA
TATAAGAGAC ACTTCCCATC AATTAACTGG TTAACATCAT ACTCATTATA CTTAGAAAAA
ATGGGTGAAT GGATGGATGC TCACGTAGCA GACGATTGGT CAGCATTAAG AACAGAAGCT
ATGGCACTTC TTCAAGAAGA AGCAAACTTA GAAGAAATAG TAAGACTTGT TGGTATGGAT
GCACTTTCAG AAGGTGATAG ATTAAAACTT GAAGTTGCTA AGTCAATAAG AGAAGACTAT
TTACAACAAA ACGCATTCCA TGAGAATGAC ACATATACTT CATTAAATAA ACAGTACAAA
ATGTTAAACT TAATCTTAAG TTTCAAACAT GAGGCTGAAA AAGCTTTAGA AGCTGGAGTT
TATTTAGATA AAGTATTAAA ACTTCCTGTT AGAGATAGAA TTGCAAGAAG TAAATATATT
TCAGAAGAAG AAATAAGTAA GATGGATGAC ATCTTAGTTG AATTAAAATC AGAGATGAAC
AAGTTAATCA GCGAGGGAGG TGTTCTAAAT GCTTAA
 
Protein sequence
MKTGKIIKVS GPLVVAEGMD EANVYDVVKV GEKGLIGEII EMRGDKASIQ VYEETSGIGP 
GDPVITTGEP LSVELGPGLI ESMFDGIQRP LDAFMKAANS AFLSKGVEVK SLNREKKWPF
VPTAKVGDKV SAGDVIGTVQ ETAVVLHRIM VPFGVEGTIK EIKAGDFNVE EVIAVVETEK
GDKNLTLMQK WPVRKGRPYA RKLNPVEPMT TGQRVIDTFF PVAKGGAAAV PGPFGAGKTV
VQHQVAKWGD TEIVVYVGCG ERGNEMTDVL NEFPELKDPK TGESLMKRTV LIANTSNMPV
AAREASIYTG ITIAEYFRDM GYSVSIMADS TSRWAEALRE MSGRLEEMPG DEGYPAYLGS
RLADYYERAG KVVALGKDGR EGAVTAIGAV SPPGGDISEP VTQSTLRIVK VFWGLDAQLA
YKRHFPSINW LTSYSLYLEK MGEWMDAHVA DDWSALRTEA MALLQEEANL EEIVRLVGMD
ALSEGDRLKL EVAKSIREDY LQQNAFHEND TYTSLNKQYK MLNLILSFKH EAEKALEAGV
YLDKVLKLPV RDRIARSKYI SEEEISKMDD ILVELKSEMN KLISEGGVLN A