Gene Cagg_1527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1527 
Symbol 
ID7267304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1863439 
End bp1866549 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content59% 
IMG OID643566370 
ProductFe-S-cluster-containing hydrogenase components 1-like protein 
Protein accessionYP_002462866 
Protein GI219848433 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGA TCAACCTTCA CCCTGCCGCG CCCGAAATCG AGGCTGTGCG CGCCCAATTA 
CAGCAGGCGC GAGGCAAACA GTTCTGGCGT TCGCTCGATC AGTTGGTCGA TACGCCGGCC
TTTCGCGAAT TGATTGCCCG CGAGTTCCCG CAGGGCGCGA GCGAGCTGGC CGATCCGGTA
TCGCGCCGCA CTTTTCTCAA GCTGATGGGA GCGTCGCTAG CGCTGGCCGG TCTCTCCGGC
TGTACCGTTG CGTTCAAACA GCCGCAAGAG AAAGTGGCTC CCTTTGCCCG CGCACCGCGC
GATCAGATTC CGGGCATTCC GAACTATTAC GCCACTGCTG TCATGCTTGA TGGGTTCGCC
CTCGGGGTTA CGGTAAAGAG TAACGACGGG CGCCCAACGA AGATCGAGGG GAATCCGGCC
CATCCGGCAA GCCTCGGTGC GACTGATGCC TTCGCACAAG CTGAGTTGTT GGCACTTTAC
GATCCCGACC GACCCGAAAC AGTGCGTCGG TTCGGTTTGC TGAGTACGTG GGAGGCGTTT
GTTACGGCAA TTAACGAACC CTTGCAGGTG CAACGGGCAT TGCAGGGTCA AGGGTTGCGG
TTGTTGACGC CAACCATTAC CTCGCCCACA TTACGCGCCC AGATAGCCGA ACTATTGACC
AACTTCCCGG CAGCACGCTG GGTTCAGTAC GATCCGGTAG GCCGCAGCAA CACCTACGCC
GGTGCAGCAT TGGCGTTCGG GGCGCCGTAT GAGCCTCGTT ATAACTTCGC CGAGGCTGAG
GTCGTCTTGG CACTCGATGT CGATTTTGTT AGCGAAGGTC CCGGTCGGGT TCGCTATGCG
CGCGATCTGA TGCAACGCCG CCGGGTGCGC GCCGAAACGA CCACAATGAG TCGGTTGTAC
GTCGCCGAGC CTGTCTTCTC ACCGACCGGT GCCGTTGCCG ATCACCGCCT GGCAATCCAG
GCCGGTCTGG TCGGGCAATT GGCTGCTGCT ATTGCCAATG AGTTAGGAGT GACAGCAGTA
GCGCCGGCGA CCGGCCTGAA CGAGGTTCAG CAGAAGTGGG TGGCGACCGT CGCCGCCGAT
CTACGTCGTG CCGGCTCACG GGCAGTTGTC GTGGTGGGCG AGGCGCAGCC GCCGGTTGTG
CATGCCATTG CCCATGCGAT CAATGTGCAG CTCGGTGCCG TGGGGACAAC GGTTGAATTG
ACCGAGCCGG TTGCCCAACC GGCCGATCCG CAGGATTTGG TGACCTTAAC TGAGGAGTTA
CGGGCCGGTA GTGTTGAGCT GTTGGTTATT ATTGATAGCA ACCCCGTCTT TACGGCACCG
GCTGATCTGA ACTTCGCCAA GGCGATGAAG CAGGCGAAGC AGGTTGTGGT GCTGAACCCC
TACGAAGACG AAACGGCGGT TCAGGCCTCG TGGTTTATCC CATTGACCCA TCCCCTCGAG
TCGTGGTCGG ACGCGCGCGC GTATGATGGT ACCGTGAGCA TTATTCAGCC GCTCATTCGC
CCTCTCTATA GCTCGCGTAC CGCTCACGAA TTGCTGGCTG TCCTGAACGG TGCCGTCGGA
ACGACAGATT ACGATAGTGT CCGAACGTAT TGGCAGCAGC AAACCGGTCT TGATAATGCC
GCTTTTGATG ATTTCTTCAA GCGTGCCCTA AGCACCGGCG TGATCGAGGG AACGCGCTTG
GAGCCGGTAA ATGTGTCGTT GGTCGCAGGG TTACAGTTAC AGGCGCCACC GCCTACGACC
GGTCTGGAAT TGTTGTTCCG CCCCGATCCG GCGATCTGGG ATGGGCGTTT TGCCAATAAC
GGCTGGCTGC AAGAGCTGCC CCGCCCGATG ACTAAGCTGA CGTGGGATAA TGCCGCGCTG
GTGAGTCCGC GCACAGCCAT CCGGTTGCTC AACCTGCCGT TTGACCCGGC CAGCTTGGCT
GCGCCCGGTC GAGCACGCGA TCAGGCACTC GAACGCCTTA CCGGTGAAAA CGGTCGCATG
ATCGACATTA CGACGCCGGT GGGGACGTTG CGCATGCCCA TCTGGATTGT CCCCGGTCAT
GCCGATGATA CGATTACGGT GACGCTGGGC TATGGCCGTA CCCATGGCGG GCGAGTGGCC
GAAGGCGCTG GGTTCAATGT CTATCGCCTG CGCCAGAGCG CCAATCCGTG GCTAGTGGCC
GACGTGAGCG CAACCGCTGT GAACGAGCGT TATCTGCTGG TCAGCACCCA AGATCACTGG
ACGTTGGAAG GACGTGATGT GGTGCGCGCC GGTGAGTTTG TTCGCTTCAA AGAAGATCCG
AAGTATATCG CCAAAGAAGT CTATGCTGAG AAGTACGGGT CACCGGAACG GAAGCCGCAA
TACCAATCAC TGCTCCCCGG TTTCGACTAT AGCACCGGTA ATCAGTGGGG GATGGTCATC
GACCTCTCGG CGTGTATCGG TTGCAATGCG TGTGTAGTTG CCTGCCAGGC TGAAAACAAC
ATTCCGATTG TCGGTAAAAA CGAAGTGGCT CGTGGGCGTG AGATGCACTG GATCCGGATT
GACCGCTATT ACGCCGGTGA GGATCTCGAC AATCCAGAAG CATACTTAAT GCCGATGACC
TGTGCTCACT GTGAGAAGGC ACCCTGCGAG CTGGTCTGCC CGGTGGCGGC TACCGTCCAC
GATGCCGAGG GCATTAACAA TATGGTGTAC AACCGTTGTG TCGGTACGAA GTACTGCTCG
AACAACTGTC CGTTTAAGGT TCGCCGGTTC AACTTCTTGC AGTACAGCGA CCTGACTACC
GAGAGCCTCA AGCTCATGCG CAACCCGCAA GTGACGGTAC GCAACCGTGG TGTGATGGAG
AAGTGCAGCT ACTGCGTACA GCGGATCAGT GCAGCTCGGA TCAAGGCGAA GGTTGAGGGC
AACCGCTCCA TCCGTGATGG TGAAGTCGTA GCTGCTTGTC AGCAGGTTTG CCCGACCGAG
GCTATCATAT TCGGCAACAT CAACGATCCC AATAGTCGGG TAGCACAGCT CAAGCAGCAG
CCGCATAACT ACACCGTGTT CGACGAATTG AATCTCAAGC CGCGCACGAG CTATCTGGCG
CGGGTACGTA ACCCCGAAGA ATCACTTGAT GGTGGTCACA GTGCAGGGTA G
 
Protein sequence
MSEINLHPAA PEIEAVRAQL QQARGKQFWR SLDQLVDTPA FRELIAREFP QGASELADPV 
SRRTFLKLMG ASLALAGLSG CTVAFKQPQE KVAPFARAPR DQIPGIPNYY ATAVMLDGFA
LGVTVKSNDG RPTKIEGNPA HPASLGATDA FAQAELLALY DPDRPETVRR FGLLSTWEAF
VTAINEPLQV QRALQGQGLR LLTPTITSPT LRAQIAELLT NFPAARWVQY DPVGRSNTYA
GAALAFGAPY EPRYNFAEAE VVLALDVDFV SEGPGRVRYA RDLMQRRRVR AETTTMSRLY
VAEPVFSPTG AVADHRLAIQ AGLVGQLAAA IANELGVTAV APATGLNEVQ QKWVATVAAD
LRRAGSRAVV VVGEAQPPVV HAIAHAINVQ LGAVGTTVEL TEPVAQPADP QDLVTLTEEL
RAGSVELLVI IDSNPVFTAP ADLNFAKAMK QAKQVVVLNP YEDETAVQAS WFIPLTHPLE
SWSDARAYDG TVSIIQPLIR PLYSSRTAHE LLAVLNGAVG TTDYDSVRTY WQQQTGLDNA
AFDDFFKRAL STGVIEGTRL EPVNVSLVAG LQLQAPPPTT GLELLFRPDP AIWDGRFANN
GWLQELPRPM TKLTWDNAAL VSPRTAIRLL NLPFDPASLA APGRARDQAL ERLTGENGRM
IDITTPVGTL RMPIWIVPGH ADDTITVTLG YGRTHGGRVA EGAGFNVYRL RQSANPWLVA
DVSATAVNER YLLVSTQDHW TLEGRDVVRA GEFVRFKEDP KYIAKEVYAE KYGSPERKPQ
YQSLLPGFDY STGNQWGMVI DLSACIGCNA CVVACQAENN IPIVGKNEVA RGREMHWIRI
DRYYAGEDLD NPEAYLMPMT CAHCEKAPCE LVCPVAATVH DAEGINNMVY NRCVGTKYCS
NNCPFKVRRF NFLQYSDLTT ESLKLMRNPQ VTVRNRGVME KCSYCVQRIS AARIKAKVEG
NRSIRDGEVV AACQQVCPTE AIIFGNINDP NSRVAQLKQQ PHNYTVFDEL NLKPRTSYLA
RVRNPEESLD GGHSAG