Gene Cphamn1_1592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1592 
Symbol 
ID6375270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1716422 
End bp1718140 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content50% 
IMG OID642684080 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_001959994 
Protein GI189500524 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.663154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAA AAATAGTTGT AGATCCGATT CCCAGAATTG AGGGACATTT ACGGATAGAA 
GCAAAACTCG GCGAAAACAA CGTTATTGAA GATGCATTCA GTAGCGGCAC CATGTGGCGC
GGTATTGAAA TTATTCTCAA GGGAAGGGAT CCACGTGATG CCTGGGCATT TGCAGAGCGT
ATCTGTGGTG TCTGCACCAC TGTTCATGCG CTTGCATCGG TCAGGTGTGT GGAGGACGCT
CTGGGTATAG AGATACCCGA GAACGCAAGG ATCATCAGAA ACCTTATGAA CGCTACCCAG
CAGACTCAGG ATCATCTGGT GCATTTCTAT CATCTCCATG CTCTGGACTG GGTGGATGTC
GTGAGCGCTC TCAAGGCTGA CCCGAAAGCA GCTTCACGAA TTGCACAGAG CCTCTCCTCC
TGGCCCAAAT CATCACCGGG TTATTTCAAG GACCTGCAAC AGAAACTTGT GGGTTTTGTG
GAAAGCGGAC AGCTTGGCAT TTTTGCCAAC GGGTACTGGG GACATCCCGC CTACAAACTG
CCGCCTGAAG TCAATCTCAT CGCGGTAGCT CATTATCTCG AGGCTCTTGA TTTTCAGAAA
GAAATCGTCA AGATTCATAC TGTGTTCGGA GGAAAGAACC CTCATCCTAA TTATCTGGTC
GGCGGTATGG CCTGTGCGCT CGATCCGGAT AGGGACACAG CGATCAATAT CGAGACCCTC
AGTATGGTGA AGAAAATCAT CGAGGATACC CAGACCTTTA TAGAGCAGGT GTATATTCCG
GATCTGATCG CTATTGCCGG TTTTTACAAG GATTGGGGAT ACGGGGGCGG ACTTGGCAAC
TACCTCTGTT ACGGCGATTT TCCTGAAAAA AGTATCGATG ACGCAGCTTC GCTGCTCTGG
CCGAGAGGAG CTATACTCAA CAAGGATCTT TCGACAATAC ATGATGTCGA TCCTCGTGAT
CTCAGTCAGG TTACCGAAGA GGTCAGCCAC AGCTGGTATA CCTACAGCAA TGGTGATGAA
AAAGGCCTGC ACCCTTGGGA GGGTGAGACA AAACCGGAAT ACACCGGACC GAAACCACCG
TTCGAATATC TGGACACGGA GAAGAAATAC AGCTGGCTGA AAACACCTCG CTGGAAGAAT
AATCCTATGG AGGTGGGTCC GCTTGCAAGG GTTCTGCTTG CTTACGCAAA GGGCGACCCG
ATGATCACGG ATACCGTCAA TATGGTACTT GGCAAGCTTG AGGTAGGACC AGAGGCGCTA
TTTTCAACAT TGGGAAGAAC AGCCGCCCGG GGTATAGAGT GCCTGCAGAC CGCAGGTTTT
ATGATGCATT ACTATGATCA GCTTATTGAC AATATCAAGA CAGGCGATTT ACGTACGGCA
AACACAGAAC TGTGGGATCC TTCCCGATGG CCGAAAGAAG CCAAGGGGTT TGGGTATACT
GAGGCTCCGC GCGGCGCGCT TGGTCACTGG ATTCATATCA GGGACGGCAA GATAGAGGAC
TACCAGATTG TTGTTCCTTC AACATGGAAC GCTTCGCCCC GTGATGCAAA TGGTGCGGTC
GGTGCCTACG AATCGGCGCT CAAAGGCACT CCCATGGTGG ATCCTGAACA GCCGCTCGAA
ATACTCAGAA CCATTCATTC TTTTGATCCT TGTCTTGCCT GCGCTTCTCA TGTTTTTGAC
ATGAACGGCA ACGAGATAAC AAAGGTCACC ATCGTTTAA
 
Protein sequence
MAKKIVVDPI PRIEGHLRIE AKLGENNVIE DAFSSGTMWR GIEIILKGRD PRDAWAFAER 
ICGVCTTVHA LASVRCVEDA LGIEIPENAR IIRNLMNATQ QTQDHLVHFY HLHALDWVDV
VSALKADPKA ASRIAQSLSS WPKSSPGYFK DLQQKLVGFV ESGQLGIFAN GYWGHPAYKL
PPEVNLIAVA HYLEALDFQK EIVKIHTVFG GKNPHPNYLV GGMACALDPD RDTAINIETL
SMVKKIIEDT QTFIEQVYIP DLIAIAGFYK DWGYGGGLGN YLCYGDFPEK SIDDAASLLW
PRGAILNKDL STIHDVDPRD LSQVTEEVSH SWYTYSNGDE KGLHPWEGET KPEYTGPKPP
FEYLDTEKKY SWLKTPRWKN NPMEVGPLAR VLLAYAKGDP MITDTVNMVL GKLEVGPEAL
FSTLGRTAAR GIECLQTAGF MMHYYDQLID NIKTGDLRTA NTELWDPSRW PKEAKGFGYT
EAPRGALGHW IHIRDGKIED YQIVVPSTWN ASPRDANGAV GAYESALKGT PMVDPEQPLE
ILRTIHSFDP CLACASHVFD MNGNEITKVT IV