Gene Cagg_1627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1627 
Symbol 
ID7268928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1985992 
End bp1987272 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content60% 
IMG OID643566468 
Product4Fe-4S ferredoxin iron-sulfur binding domain protein 
Protein accessionYP_002462964 
Protein GI219848531 
COG category[C] Energy production and conversion 
COG ID[COG1143] Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.839914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.132802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTAC CTCCATTTGA GATAGAGATA CCGCACGGGC GTGTGATAGA GGTGCAGCGT 
GATGATACGC GGGTGCGGCG CGGTGGTGGC TTGCTGCGTG GGATGGCCGT GATTTGGAAG
CACTGGAAAG AGTCTTTTAA GCTCAACCGG AGCTATAGCC AGATCCACGG TACGTTTACT
ATTCAGTACC CAGAGGAACG GGCACGTATT CCCGAAACCT ATCGCAACAT GCCGATCCTC
CTTTACGACG ATGAGACCGG CCACGAACTC TGCACCTCAT GTTTTCAATG TGAGCGAATT
TGTCCGCCAC AGGTAATCCA TATGACGCAG GCGAAAGATC CGGCTACCGG TAAACCGGTG
CCGGCAGTGG CGGAGTTCAT TATCGAGTAC GATGCCTGTA TGAGTTGTGG TTTCTGCGCC
GAAGTCTGTC CCTTCGACGC GATCAAGATG GATCACGAGT TTGAACTTTC AACTGACGAT
CACGCTCGGT TAACGGTGCA TAAAGAGCAG CTCAATCGTC CGATCAGCTA TTATGCCAAG
ATTGCACCTA CAATGTGGGC TGAGGTGCGC GAAAGTGCGT TGAAGAAATT GCAAGGCAAT
ATCCGTCGTC GCCCTGATGT GATCGGTATT GCACCTCACC TGACCGAACG GATTGCGGCG
CGACGGGCTG AGTTGGCGGC ACAAGCCGCC CAAACTACAC CGGTAGCACC GTCAGATCAG
GCAACACCAC CGACGACAGA AGGTCGCAAG TTGACACCGG AAGAGAAGGC AGCCAAGCTA
GCCGCTATCC GTGCCGCCAA GGGAGCGCAG GCCGGTGATG GCGCATCGCC GGCTACCGAA
TCTGCACCAC CGACCGACAA AGCGGCGAAA TTAGCTGCGA TCCGAGCGGC CAATGCGGCG
AAGCGGGCGG CAGCGGGCGA GTCGACGACA CCTGCGGCGA CGGAGCCTCC GGCGGCGCCA
CCGACCGACA AAGCAGCGAA ATTAGCTGCG ATCCGGGCGG CCAATGCGGC GAAGCGGGCG
GCAGCGGGCG AGTCGACGAC ACCTGCGGCG ACGGAGCCTC CGGCGGCGCC ACCGACCGAC
AAAGCAGCGC GGCTGGCTGC GATCCGGGCG GCCAATGCGG CGAAGCGGGC GGCAGCGGGC
GAGTCGACAA CACCTGCGGC GACGGAGCCA CCGGCGGAGC CTTCGGCGGC GCCACCGACC
GACAAAGCAG CGCGGCTGGC TGCGATCCGG GCGGCCAATG CGGCGAAGAA ACAACAATCG
GGTGATGAAT CATCATCGTA G
 
Protein sequence
MTVPPFEIEI PHGRVIEVQR DDTRVRRGGG LLRGMAVIWK HWKESFKLNR SYSQIHGTFT 
IQYPEERARI PETYRNMPIL LYDDETGHEL CTSCFQCERI CPPQVIHMTQ AKDPATGKPV
PAVAEFIIEY DACMSCGFCA EVCPFDAIKM DHEFELSTDD HARLTVHKEQ LNRPISYYAK
IAPTMWAEVR ESALKKLQGN IRRRPDVIGI APHLTERIAA RRAELAAQAA QTTPVAPSDQ
ATPPTTEGRK LTPEEKAAKL AAIRAAKGAQ AGDGASPATE SAPPTDKAAK LAAIRAANAA
KRAAAGESTT PAATEPPAAP PTDKAAKLAA IRAANAAKRA AAGESTTPAA TEPPAAPPTD
KAARLAAIRA ANAAKRAAAG ESTTPAATEP PAEPSAAPPT DKAARLAAIR AANAAKKQQS
GDESSS