Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1527 |
Symbol | |
ID | 7267304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 1863439 |
End bp | 1866549 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643566370 |
Product | Fe-S-cluster-containing hydrogenase components 1-like protein |
Protein accession | YP_002462866 |
Protein GI | 219848433 |
COG category | [C] Energy production and conversion |
COG ID | [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGA TCAACCTTCA CCCTGCCGCG CCCGAAATCG AGGCTGTGCG CGCCCAATTA CAGCAGGCGC GAGGCAAACA GTTCTGGCGT TCGCTCGATC AGTTGGTCGA TACGCCGGCC TTTCGCGAAT TGATTGCCCG CGAGTTCCCG CAGGGCGCGA GCGAGCTGGC CGATCCGGTA TCGCGCCGCA CTTTTCTCAA GCTGATGGGA GCGTCGCTAG CGCTGGCCGG TCTCTCCGGC TGTACCGTTG CGTTCAAACA GCCGCAAGAG AAAGTGGCTC CCTTTGCCCG CGCACCGCGC GATCAGATTC CGGGCATTCC GAACTATTAC GCCACTGCTG TCATGCTTGA TGGGTTCGCC CTCGGGGTTA CGGTAAAGAG TAACGACGGG CGCCCAACGA AGATCGAGGG GAATCCGGCC CATCCGGCAA GCCTCGGTGC GACTGATGCC TTCGCACAAG CTGAGTTGTT GGCACTTTAC GATCCCGACC GACCCGAAAC AGTGCGTCGG TTCGGTTTGC TGAGTACGTG GGAGGCGTTT GTTACGGCAA TTAACGAACC CTTGCAGGTG CAACGGGCAT TGCAGGGTCA AGGGTTGCGG TTGTTGACGC CAACCATTAC CTCGCCCACA TTACGCGCCC AGATAGCCGA ACTATTGACC AACTTCCCGG CAGCACGCTG GGTTCAGTAC GATCCGGTAG GCCGCAGCAA CACCTACGCC GGTGCAGCAT TGGCGTTCGG GGCGCCGTAT GAGCCTCGTT ATAACTTCGC CGAGGCTGAG GTCGTCTTGG CACTCGATGT CGATTTTGTT AGCGAAGGTC CCGGTCGGGT TCGCTATGCG CGCGATCTGA TGCAACGCCG CCGGGTGCGC GCCGAAACGA CCACAATGAG TCGGTTGTAC GTCGCCGAGC CTGTCTTCTC ACCGACCGGT GCCGTTGCCG ATCACCGCCT GGCAATCCAG GCCGGTCTGG TCGGGCAATT GGCTGCTGCT ATTGCCAATG AGTTAGGAGT GACAGCAGTA GCGCCGGCGA CCGGCCTGAA CGAGGTTCAG CAGAAGTGGG TGGCGACCGT CGCCGCCGAT CTACGTCGTG CCGGCTCACG GGCAGTTGTC GTGGTGGGCG AGGCGCAGCC GCCGGTTGTG CATGCCATTG CCCATGCGAT CAATGTGCAG CTCGGTGCCG TGGGGACAAC GGTTGAATTG ACCGAGCCGG TTGCCCAACC GGCCGATCCG CAGGATTTGG TGACCTTAAC TGAGGAGTTA CGGGCCGGTA GTGTTGAGCT GTTGGTTATT ATTGATAGCA ACCCCGTCTT TACGGCACCG GCTGATCTGA ACTTCGCCAA GGCGATGAAG CAGGCGAAGC AGGTTGTGGT GCTGAACCCC TACGAAGACG AAACGGCGGT TCAGGCCTCG TGGTTTATCC CATTGACCCA TCCCCTCGAG TCGTGGTCGG ACGCGCGCGC GTATGATGGT ACCGTGAGCA TTATTCAGCC GCTCATTCGC CCTCTCTATA GCTCGCGTAC CGCTCACGAA TTGCTGGCTG TCCTGAACGG TGCCGTCGGA ACGACAGATT ACGATAGTGT CCGAACGTAT TGGCAGCAGC AAACCGGTCT TGATAATGCC GCTTTTGATG ATTTCTTCAA GCGTGCCCTA AGCACCGGCG TGATCGAGGG AACGCGCTTG GAGCCGGTAA ATGTGTCGTT GGTCGCAGGG TTACAGTTAC AGGCGCCACC GCCTACGACC GGTCTGGAAT TGTTGTTCCG CCCCGATCCG GCGATCTGGG ATGGGCGTTT TGCCAATAAC GGCTGGCTGC AAGAGCTGCC CCGCCCGATG ACTAAGCTGA CGTGGGATAA TGCCGCGCTG GTGAGTCCGC GCACAGCCAT CCGGTTGCTC AACCTGCCGT TTGACCCGGC CAGCTTGGCT GCGCCCGGTC GAGCACGCGA TCAGGCACTC GAACGCCTTA CCGGTGAAAA CGGTCGCATG ATCGACATTA CGACGCCGGT GGGGACGTTG CGCATGCCCA TCTGGATTGT CCCCGGTCAT GCCGATGATA CGATTACGGT GACGCTGGGC TATGGCCGTA CCCATGGCGG GCGAGTGGCC GAAGGCGCTG GGTTCAATGT CTATCGCCTG CGCCAGAGCG CCAATCCGTG GCTAGTGGCC GACGTGAGCG CAACCGCTGT GAACGAGCGT TATCTGCTGG TCAGCACCCA AGATCACTGG ACGTTGGAAG GACGTGATGT GGTGCGCGCC GGTGAGTTTG TTCGCTTCAA AGAAGATCCG AAGTATATCG CCAAAGAAGT CTATGCTGAG AAGTACGGGT CACCGGAACG GAAGCCGCAA TACCAATCAC TGCTCCCCGG TTTCGACTAT AGCACCGGTA ATCAGTGGGG GATGGTCATC GACCTCTCGG CGTGTATCGG TTGCAATGCG TGTGTAGTTG CCTGCCAGGC TGAAAACAAC ATTCCGATTG TCGGTAAAAA CGAAGTGGCT CGTGGGCGTG AGATGCACTG GATCCGGATT GACCGCTATT ACGCCGGTGA GGATCTCGAC AATCCAGAAG CATACTTAAT GCCGATGACC TGTGCTCACT GTGAGAAGGC ACCCTGCGAG CTGGTCTGCC CGGTGGCGGC TACCGTCCAC GATGCCGAGG GCATTAACAA TATGGTGTAC AACCGTTGTG TCGGTACGAA GTACTGCTCG AACAACTGTC CGTTTAAGGT TCGCCGGTTC AACTTCTTGC AGTACAGCGA CCTGACTACC GAGAGCCTCA AGCTCATGCG CAACCCGCAA GTGACGGTAC GCAACCGTGG TGTGATGGAG AAGTGCAGCT ACTGCGTACA GCGGATCAGT GCAGCTCGGA TCAAGGCGAA GGTTGAGGGC AACCGCTCCA TCCGTGATGG TGAAGTCGTA GCTGCTTGTC AGCAGGTTTG CCCGACCGAG GCTATCATAT TCGGCAACAT CAACGATCCC AATAGTCGGG TAGCACAGCT CAAGCAGCAG CCGCATAACT ACACCGTGTT CGACGAATTG AATCTCAAGC CGCGCACGAG CTATCTGGCG CGGGTACGTA ACCCCGAAGA ATCACTTGAT GGTGGTCACA GTGCAGGGTA G
|
Protein sequence | MSEINLHPAA PEIEAVRAQL QQARGKQFWR SLDQLVDTPA FRELIAREFP QGASELADPV SRRTFLKLMG ASLALAGLSG CTVAFKQPQE KVAPFARAPR DQIPGIPNYY ATAVMLDGFA LGVTVKSNDG RPTKIEGNPA HPASLGATDA FAQAELLALY DPDRPETVRR FGLLSTWEAF VTAINEPLQV QRALQGQGLR LLTPTITSPT LRAQIAELLT NFPAARWVQY DPVGRSNTYA GAALAFGAPY EPRYNFAEAE VVLALDVDFV SEGPGRVRYA RDLMQRRRVR AETTTMSRLY VAEPVFSPTG AVADHRLAIQ AGLVGQLAAA IANELGVTAV APATGLNEVQ QKWVATVAAD LRRAGSRAVV VVGEAQPPVV HAIAHAINVQ LGAVGTTVEL TEPVAQPADP QDLVTLTEEL RAGSVELLVI IDSNPVFTAP ADLNFAKAMK QAKQVVVLNP YEDETAVQAS WFIPLTHPLE SWSDARAYDG TVSIIQPLIR PLYSSRTAHE LLAVLNGAVG TTDYDSVRTY WQQQTGLDNA AFDDFFKRAL STGVIEGTRL EPVNVSLVAG LQLQAPPPTT GLELLFRPDP AIWDGRFANN GWLQELPRPM TKLTWDNAAL VSPRTAIRLL NLPFDPASLA APGRARDQAL ERLTGENGRM IDITTPVGTL RMPIWIVPGH ADDTITVTLG YGRTHGGRVA EGAGFNVYRL RQSANPWLVA DVSATAVNER YLLVSTQDHW TLEGRDVVRA GEFVRFKEDP KYIAKEVYAE KYGSPERKPQ YQSLLPGFDY STGNQWGMVI DLSACIGCNA CVVACQAENN IPIVGKNEVA RGREMHWIRI DRYYAGEDLD NPEAYLMPMT CAHCEKAPCE LVCPVAATVH DAEGINNMVY NRCVGTKYCS NNCPFKVRRF NFLQYSDLTT ESLKLMRNPQ VTVRNRGVME KCSYCVQRIS AARIKAKVEG NRSIRDGEVV AACQQVCPTE AIIFGNINDP NSRVAQLKQQ PHNYTVFDEL NLKPRTSYLA RVRNPEESLD GGHSAG
|
| |