Gene Ccur_00990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_00990 
Symbol 
ID8374307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp123620 
End bp125350 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content51% 
IMG OID644993023 
ProductNi,Fe-hydrogenase III large subunit 
Protein accessionYP_003150514 
Protein GI256826555 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.823272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones144 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACAA GACCTTTAGA GAAACGGCCG GGTGCGACGC ACGCTGAAGC GGTGCGCAAT 
CGCTTCCCTG GCGTGGTGCG AGACGTGAGC TGGCAGGATG AAGATCAAAT GACGATCACT
GTCGCCATTG ATTCGCTGCC TGACGTTGTC GAATACCTTT ACTTTGGTCG TGGTGGATTT
CTGCCGATGA TGGTAGGTAA CGACGAACGT CCACTGACGG GCAATTACGC GCTGTACTAT
ATTTTATCTA TGGAAGAAGA AGATCCATGC TGGTGCACGG TGCGTGTTGA GGTGCCAGCC
GATACATGCG AGTTCCCTTC AGTAACACCG CGTGTTCCTG CTTGCGTATG GAGCGAACGC
GAAGTACGCG ATATGTACGG CCTCACTCCT GTTGGTTTAC CTGATGAACG TCGCCTGGTT
CTTCCTGATG ACTGGCCAGA CGACCTTTAT CCGCTGCGTA AGGACTCGAT GGATTACCGT
CATCGTCCTA TGCCAGCAAG CGATGTAGAA AATTATGAAT TCCTGGCTGA TACGGGCGAG
CATGAAACGA CAATCATGCC TATGGGTCCG TTGCATATTA CCTCTGACGA ACCCGGACAT
TTTCGCCTGT TCGTCGAAGG TGAAAACATT ATTGATGCAG ATTATCGCTT GTTCTATGTT
CATCGCGGCA TGGAAAAAGT TGCTGAATCG CGTATGAATT ATGATGCGGT TACCTTCCTG
GCCGACCGTG TTTGTGGCAT TTGCGGTAAC GCTCATTCGG TTGCGTACGC CGAGGCAGTC
GAGCATGCTC AAGGTATTGA AGTACCCGAG CGTGCCCAGT ATATTCGTGC TATCTCGTTG
GAAGTTGAGC GCATGCACTC TCATCTGCTC AACTTAGGCT TGGTATGTCA CTACTGCGGT
TTTGATACGG GTTTCCAGCA TTTCTTCCGC GTACGCGAAG ATTCGATGCG TCTTGCTGAA
TTGCTGACCG GTCATCGCAA GACCTATGGC ATCAATCTTA TCGGCGGTGT GCGCCGCGAT
ATCCTTTCCG AGCAGAAGCT GGCGACATTC AAAGCGGTTG ATAAACTGCG TAAAGACGTT
AAAGGCTTGG TTGACGAGCT AATGAGCACG CCAAACTTTA TTGATCGTAC AAAAGGTGTT
GGGCGACTTG ACCCGCAGAT CGCTCGTGCA TTTAGCCCAG TTGGTCCCTG CATGCGTGGT
TCAGGCTTTA CACGCGACGT GCGGTTTGAT CATCCATTCG ATGGCTATAA GTTCCTGCCG
GACACCTTCA AAGCACGTTC ACATGATGGT TGCGATGTTA TGAGCCGCTC TATGGTTCGT
ATCGAAGAGT TCCTTGATAG CTGCGATATG GTTGAATATC TGCTCGATAA TGCACCAGAA
GGTCCAATTC TGACGCAGGA TTGGACGTAC ACGCCGCATA AATATGCGTT GGGTTATACT
GAGGCGCCGC GCGGCGAGGA TACACACTGG GCTATGGTTG GCGATAACCA AAAGTGCTAT
CGCTGGCGTG CCAAGGCAGC TACGTATAGT AACTGGCCTA TTCTGCGCTA TATGTTCCGT
GGCAATACCA TTTCTGATGC AGCGCTTATC GTCGGCAGTA TGGACCCGTG CTACTCTTGT
ACTGACCGGG TAACGGTGGT AGACGTTGAG AAGAACACCA GTAAGACACT CACAAAAGAT
CAGCTGGAAT CATACTGCGT CCGCCGTACG CATTCTCCGC TGAAGGATTA G
 
Protein sequence
MDTRPLEKRP GATHAEAVRN RFPGVVRDVS WQDEDQMTIT VAIDSLPDVV EYLYFGRGGF 
LPMMVGNDER PLTGNYALYY ILSMEEEDPC WCTVRVEVPA DTCEFPSVTP RVPACVWSER
EVRDMYGLTP VGLPDERRLV LPDDWPDDLY PLRKDSMDYR HRPMPASDVE NYEFLADTGE
HETTIMPMGP LHITSDEPGH FRLFVEGENI IDADYRLFYV HRGMEKVAES RMNYDAVTFL
ADRVCGICGN AHSVAYAEAV EHAQGIEVPE RAQYIRAISL EVERMHSHLL NLGLVCHYCG
FDTGFQHFFR VREDSMRLAE LLTGHRKTYG INLIGGVRRD ILSEQKLATF KAVDKLRKDV
KGLVDELMST PNFIDRTKGV GRLDPQIARA FSPVGPCMRG SGFTRDVRFD HPFDGYKFLP
DTFKARSHDG CDVMSRSMVR IEEFLDSCDM VEYLLDNAPE GPILTQDWTY TPHKYALGYT
EAPRGEDTHW AMVGDNQKCY RWRAKAATYS NWPILRYMFR GNTISDAALI VGSMDPCYSC
TDRVTVVDVE KNTSKTLTKD QLESYCVRRT HSPLKD