Gene CPF_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1895 
Symbol 
ID4201195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2130044 
End bp2131993 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content32% 
IMG OID638082764 
ProductV-type ATP synthase subunit I 
Protein accessionYP_696328 
Protein GI110799711 
COG category[C] Energy production and conversion 
COG ID[COG1269] Archaeal/vacuolar-type H+-ATPase subunit I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATAG TTAAGATGAA TAAATTCACC TTACTTGCCT TTGAATCAGA GAAAGAAAAG 
CTTCTTAAAA AAATTCAAAG TTCTTCAGAA GTTGAGTTCA TAAACTTACA AGATGAAGAA
AAAGTGGAAG GCAATGAAGT CTTTGAATCT CTTAGCAAAG ATGATTTAGA TGCTGAAACA
ACTCTTATAG AAGAAAATGT TTCAAAAACA AGATCAGCTT TAGACTTCTT AAAAGAGTTT
GTTGCTGAAG TTGGAGGACT TAAGGCTTTA AAAGCTGGTA AAGAAAGCTT AACTTTGGAT
GAGCTAGAAG TAAAGGTTGA AAATAGCAAC TGGGAAGTCG TTTATTCAGA GGTTAAGAAA
ATGGAGAGAG AGCTTGCTAC TCTAGAAAAC GAAAAGACAA AACTTTTAGG TGAAATAGAA
ATCTTAGAGC CATGGCAAAG TTTCGATGCT CCACTAGGTG AATTAAATAA TTTTGAAAAA
GTAGTTGCGT TCTTAGGTGT GATTTCAAGT CAAAACCTTG AAAACATGAA GAACGAAATT
GAAAGCGAAT TCAAGGAAAG TTATATAGAA GTTATTTCTA ACACTGACCA AGATTCATAT
GTCTTTGTAA TGACAATGAA AGAAAGAGCT GAAGAAATGG ACGAAGTTTT AAAGAACTTT
GGTTTTTCAG CGTTCCAAAC TAAGTACAAA GAAAAAGCTT CAGTTCTTGT TGAAGAGTTT
AAATTAAAGA TACAAGAAAT CGAAGCTCAA AAATCAGACT TAAAAGGAGT ACTTTCAAAT
TACAGAGAAG AAAAGAGAAC TTTAGAACTA GCTTATGAAT ATTACAGCAA CATACTTCTT
AGAAAAGAAG CAAGTGAAAA CTTCTTAAAA ACAGATAAAG TAGTTGTTAT TCAAGGTTGG
GTTCCTAAAA ATGATAATTC TTCTTTAGAA GGAATCATTC AAAGTTCAGT AGGAGATATG
TATTATCTTG AGTTTGAAGA AGTTAAGGAA GAAGAGGTTG CAGAAGTTCC AGTTAAATTA
CACAATAAGG GACCAGCAGC AGCTTTCGAT TCAATAACAG AAATGTATAG CTTACCAAGA
TATGATGAAA TAGACCCTAC ACCACTTTTA ACACCTTTCT ACCTAGTATT CTTCGGCATG
ATGGTTGCAG ATTTAGGATA CGGACTAGTA TTATTTGTAG GATCACTATT AGCTATGAAA
TTACTTAATT TAGATGAGGC ACAAGAAAAG TTTGCTAAAT TCTTTATGTA TTTAAGTATA
GCTACAACTA TTGCAGGTGC AGTATTTGGT ACTGCCTTTG GATTTGAATT GAAATCTATA
GGACTTATAA ATCCAAGTAA AGATACCAAC TTGTTACTAA TTTTATCAGT TGGATTTGGG
GTTATTCAAA TATTCTTTGG ATTATTTATA AAAGCGTACA TGTTAATCAG AGATAAGCAA
TATTTATATG CTTTATTTGA TGTAGGATCA TGGATTATGC TTTTAATAGG TCTTCCAATG
ATATTCTTTG ATGGTCCTAT AAGCTTAGTA GGTAAAGTAT TATCAATAGT AGGTTCAATA
TTAATAATCT TAACACAAGG TAGAGATGAA GAAACAAAAG GTGCTCAAAT AGGTCAAGGT
TTATATGCCT TATACGGTAT AACAGGATAC GTAGGAGACT TAGTATCATA TACAAGACTT
ATGGCTTTAG GTATTGCAGG TGGATCAATA GCAGCTGCAC TTAACTTAAT AATAGGTATG
TTCCCAGGAA TAGCAGTTAT AATTGTAGGA CCTCTATTCT TTATAGCAGC TCATACATTT
AACATGCTTT TATCATTACT TGGAGCATAT GTTCATACAG CGAGATTACA GTACGTTGAG
TACTTCTCAA AATTCTATGA AGGTGGCGGA AAAGCATTCA CACCATTTAG AACAATAAAT
AAATTCATAA CAATAAAAAG AAATAAATAA
 
Protein sequence
MAIVKMNKFT LLAFESEKEK LLKKIQSSSE VEFINLQDEE KVEGNEVFES LSKDDLDAET 
TLIEENVSKT RSALDFLKEF VAEVGGLKAL KAGKESLTLD ELEVKVENSN WEVVYSEVKK
MERELATLEN EKTKLLGEIE ILEPWQSFDA PLGELNNFEK VVAFLGVISS QNLENMKNEI
ESEFKESYIE VISNTDQDSY VFVMTMKERA EEMDEVLKNF GFSAFQTKYK EKASVLVEEF
KLKIQEIEAQ KSDLKGVLSN YREEKRTLEL AYEYYSNILL RKEASENFLK TDKVVVIQGW
VPKNDNSSLE GIIQSSVGDM YYLEFEEVKE EEVAEVPVKL HNKGPAAAFD SITEMYSLPR
YDEIDPTPLL TPFYLVFFGM MVADLGYGLV LFVGSLLAMK LLNLDEAQEK FAKFFMYLSI
ATTIAGAVFG TAFGFELKSI GLINPSKDTN LLLILSVGFG VIQIFFGLFI KAYMLIRDKQ
YLYALFDVGS WIMLLIGLPM IFFDGPISLV GKVLSIVGSI LIILTQGRDE ETKGAQIGQG
LYALYGITGY VGDLVSYTRL MALGIAGGSI AAALNLIIGM FPGIAVIIVG PLFFIAAHTF
NMLLSLLGAY VHTARLQYVE YFSKFYEGGG KAFTPFRTIN KFITIKRNK