Gene Caci_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0849 
Symbol 
ID8332179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp983641 
End bp985236 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content70% 
IMG OID644953999 
Product4Fe-4S ferredoxin iron-sulfur binding domain protein 
Protein accessionYP_003111623 
Protein GI256390059 
COG category[C] Energy production and conversion 
COG ID[COG1143] Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000747261 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000428975 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCGG ACAAGACAGG CGCAGAACAG CGCGGCGCGG ACAAGACCGC CGACAAGGCC 
GGCGCCCCCA AAACCAGCGC CCCCACAACC AGCGCCGACA AACGCGGCCT CCCCGGCGCC
GGCCTGGCCA AGGGCCTGGC CACGACCCTG AAGACGATGA CCCACCGCTC GGTCACCGCG
CAGTACCCGG ACGTGAAGCC GGAGCTCCCC CCGCGCTCCC GAGGCGTCAT CGCCCTGCTC
GAGGAGAACT GCACCTCCTG CATGCTCTGC GCCCGCGAAT GTCCCGACTG GTGCATCTAC
ATCGACTCCC ACAAGGAAGT CGTCCCAGCG ACAGAGCCAG GCCAACGCGA CCGCACCCGC
AACGTCCTCG ACCGCTTCGC GATCGACTAC AGCCTGTGCA TGTACTGCGG CATCTGCATC
GAAGCCTGCC CCTTCGACGC CCTCTTCTGG TCCCCCGAAT TCGAATACGC CGAATTCGAC
ATCGTCGACC TCCTCCACGA AAAGGACCGC CTCCGCGACT GGATGTGGAC CGTCCCCCCA
CCCCCACCCC ACGCCCCGAA CGCAGAGCCC CCCAAGGAAA TCGCCGCAGC CCAGAAAGCC
GTCGAAAAGC AAGCCGCCGC CGAAGCCAAG GCCGCCGCAG AACGCGCAGC CGCAGCCGCG
ACCCCACCCC CACCCGTCGA AGGCGGCACA CCCAAGCAGC CCGAAGCCGA CACGACGGAG
ATCCCCAAGA TCACCGCCAC CGACCCGCCG CCCCGCGACG CACTGAACGA CACGGCCGAA
ATCCCCGTCG TCGAGACGCA GGCTCCGACG CAGCCGCTGC CTCCCGTGCC CCCCACGACC
GAGTCGAAGT CGACGCAGGC TCCAACGCAA CCGCTGCCCC ACGTGACCCC CTCGGCCGAG
TCGAAGCTGA CGCAGGCTCC GACGCAGCCG CTCCCTCACG TGACCCCTGC GGCCGAGTCG
AACCTTTCGC CAGGGCTGAT CGCTGAGATC GCGGCGGCCA AGGCTGCTGC CGAAGTCGCG
CCGGAAGCTG TCACCGAGGC GCCGACCGAG CCCGAGCCGG AAGCTGTCGT CGAGCCTGCT
GCCGAAGCAC CGACTCCGGT CGAGCCGGAA GCTGCTGCCG AGCCAGAGGT CCAGTCAGCC
GCGACCGCCG AGCCCGCTGC CGAAGCCGCA ACCGAGGCTC CGCTCGAGCC CGAAGCCACA
GCCGCGCCCG TCGCGACCGA TAAGCCTGCT GCCGAACCCG CAGCGGAACC TGCGACCGCC
GAGCTCGCCG CCGAGCCAGA GGTCCAGGCC GCCCCCACCG AGCCCACCAC TCCCGAGTCC
ACCACCGACT CAGCATCCGG CGACTCCGAC TCCGACTCTG ACCCCGCCCC CAAGCCGACA
CCCCGCCGCC CCCGCAAAAC CGCAGCCGCC AAGACCACCA CCTCCAAAAC CGCAGCCTCC
AAAACCGCCG CCGCGAAAAA GGCCGCAGCC GCCAAAACCA CCACCCCCAA GCCCAAGCGC
CCCCGCAAAA CCGCCGCCCC CACCGACCAG CCGCCCGCGC CCCCGCAGCC CGAGTCCCCC
GAGAACGGCT CCGACTCCCC GGAGGCCGCC GGATGA
 
Protein sequence
MAADKTGAEQ RGADKTADKA GAPKTSAPTT SADKRGLPGA GLAKGLATTL KTMTHRSVTA 
QYPDVKPELP PRSRGVIALL EENCTSCMLC ARECPDWCIY IDSHKEVVPA TEPGQRDRTR
NVLDRFAIDY SLCMYCGICI EACPFDALFW SPEFEYAEFD IVDLLHEKDR LRDWMWTVPP
PPPHAPNAEP PKEIAAAQKA VEKQAAAEAK AAAERAAAAA TPPPPVEGGT PKQPEADTTE
IPKITATDPP PRDALNDTAE IPVVETQAPT QPLPPVPPTT ESKSTQAPTQ PLPHVTPSAE
SKLTQAPTQP LPHVTPAAES NLSPGLIAEI AAAKAAAEVA PEAVTEAPTE PEPEAVVEPA
AEAPTPVEPE AAAEPEVQSA ATAEPAAEAA TEAPLEPEAT AAPVATDKPA AEPAAEPATA
ELAAEPEVQA APTEPTTPES TTDSASGDSD SDSDPAPKPT PRRPRKTAAA KTTTSKTAAS
KTAAAKKAAA AKTTTPKPKR PRKTAAPTDQ PPAPPQPESP ENGSDSPEAA G