Gene Cagg_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0479 
Symbol 
ID7266647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp588650 
End bp589765 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content58% 
IMG OID643565342 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_002461856 
Protein GI219847423 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0595509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.140537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATC TTGACGAATT TCGTGACCCA GCGCTGGCCC GCCGCCTCTT CGAGCAAATC 
CGACGCATCA CCACACGTCA CTGGGCAATT ATGGAAGTCT GCGGCGGCCA GACCCATTCG
ATCATTCGCA ACGGAATCGA TCAGTTACTG CCACCCGAGA TCGAGCTTAT CCACGGCCCC
GGCTGCCCAG TCTGCGTTAC TCCACTCGAA ATCATCGACA AGGCGCTGGC CATTGCCGCC
CTCCCCGAAG TGATCTTCTG CTCATTTGGC GACATGCTGC GCGTGCCAGG TAGCCGCAAA
GACCTCTTCC GCGTCAAGAG CGAAGGTGGT GACGTGCGCG TCGTCTATTC CCCGCTCGAC
GCAGTAAAGC TGGCCCAACA ACACCCCGAC CGCCAAGTCG TCTTTTTTGC CATCGGCTTC
GAGACTACTG CACCCGCCAA CGCCATGGCG GTCTATCAGG CAGCCAGACT CGGCCTCAAG
AACTTTTCGA TGTTGGTCTC ACACGTCCTG GTACCACCGG CAATCAGTGC GATTATGGAG
TCGCCGAACA ACCGCGTCCA AGGCTTTCTA GCCGCTGGTC ATGTCTGCAG CGTCATGGGC
ATCGAAGAAT ATCGCTCGCT CGTCGAAACA TATCGTGTCC CCATTGTGGT TACCGGTTTT
GAGCCACTCG ACGTACTCGA AGGCATTCGT CGCGCCATTC TCCAACTCGA GCAAGGCCGT
GCCGAACTAG ACAACGCCTA CGAACGCGCC GTTCGCCCGG AAGGCAACGT CGCCGCCAAA
CAAATGCTTG CCGATGTCTT CACCGTCACC GACCGCACTT GGCGTGGAAT TGGGCGCATC
CCGCGCAGCG GTTGGCGGCT CAGTGACCGC TACGCCGAAT TCGATGCCGA ATTCCGATTC
AACGTCCACG ACATCCAAAC GAGCGAGTCG CCGCTATGTC GGAGTGGTGA AGTGCTGCAA
GGATTGCTCA AACCAAACCA ATGCCCGGCC TTCGGTAAAG AATGCACACC GCGGACGCCA
CTTGGCGCAA CGATGGTATC AAGCGAGGGA GCATGCGCAG CGTATTATCA GTATGGCCGA
TTCGTGCCAA CCAGCACGAT TGGTGTAGCA TCGTAA
 
Protein sequence
MKYLDEFRDP ALARRLFEQI RRITTRHWAI MEVCGGQTHS IIRNGIDQLL PPEIELIHGP 
GCPVCVTPLE IIDKALAIAA LPEVIFCSFG DMLRVPGSRK DLFRVKSEGG DVRVVYSPLD
AVKLAQQHPD RQVVFFAIGF ETTAPANAMA VYQAARLGLK NFSMLVSHVL VPPAISAIME
SPNNRVQGFL AAGHVCSVMG IEEYRSLVET YRVPIVVTGF EPLDVLEGIR RAILQLEQGR
AELDNAYERA VRPEGNVAAK QMLADVFTVT DRTWRGIGRI PRSGWRLSDR YAEFDAEFRF
NVHDIQTSES PLCRSGEVLQ GLLKPNQCPA FGKECTPRTP LGATMVSSEG ACAAYYQYGR
FVPTSTIGVA S