Gene Lcho_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2449 
Symbol 
ID6161642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2657383 
End bp2658438 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID641665219 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001791479 
Protein GI171059130 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.027326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA AGAAGGACTA CATCCGCCCG GTCGACTTCA AACATGGCCG CGTCGACATG 
AACCACGGCG CGGGCGGGCG GGCTTCGGCG CAACTGATAG CCGAGTTGTT CGCGCGTGCC
TTCGACAACG ACTACCTGCG CCAGGGCAAC GACGGTGCGC TGCTCGACAT CCCCGCCGGC
CACCGGCTGG TGATGGCGAC CGACGCGCAC GTGATCTCGC CGCTGTTCTT TCCGGGCGGC
GACATCGGCT GCCTGTCGGT GCACGGCACG GTCAACGACG TGGCGATGCT GGGCGCGACG
CCGCTGTACC TGAGCGCGAG CTTCATCCTC GAAGAAGGCT TCGCGCTGGC CGACCTCAAG
CGCATCGTCG AGTCGATGGC CGCGGCCTCG CGTGACGCGG GCGTGCCGAT CGTCACCGGC
GACACCAAGG TGGTCGAACA GGGCAAGGGC GACGGCGTGT TCATCTCCAC CACCGGCATC
GGCGTGGTGC CGATGGACCG CCAGATCGGC GGCGCGCTGG CGCGGCCGGG CGATGTGGTG
CTGGTGTCGG GCACGATCGG CGACCACGGC GTGGCGGTGC TGTCGCAACG TGAATCGCTG
GAGTTCGAGA CCACCATCGA GTCGGACACC GCCGCGCTGC ACGGCCTGGT CGCGCGCCTG
CTGGCCGCCG TGCCTGAAGG CGCCGTGCAT TGCCTGCGCG ACCCCACGCG CGGCGGCCTA
GCGACCACGC TCAACGAGAT CACGCGCCAG TCGGGCGTGG GCATGCTGCT GCAGGAGACG
GCGATTCCCG TCGCGCCGCA GGTCAACGCC GCCTGCGAGC TGCTCGGGCT CGACCCGCTC
TACATCGCCA ACGAAGGCAA GTGCATCGTG ATCTGCGCGG CCGAACACGC CGACGCGGTG
CTCGACGCGA TGCGCGCGCA CCCGCTGGGC CGCAACGCGG CGCGCATCGG CAGCGTCACC
AACGACCCGC ACCACTTCGT GCAGATGGCC ACCGGCTTCG GCGGGCGCCG CATCGTCGAC
TGGCTCAGCG GCGAGCCGCT GCCGCGCATC TGCTGA
 
Protein sequence
MSIKKDYIRP VDFKHGRVDM NHGAGGRASA QLIAELFARA FDNDYLRQGN DGALLDIPAG 
HRLVMATDAH VISPLFFPGG DIGCLSVHGT VNDVAMLGAT PLYLSASFIL EEGFALADLK
RIVESMAAAS RDAGVPIVTG DTKVVEQGKG DGVFISTTGI GVVPMDRQIG GALARPGDVV
LVSGTIGDHG VAVLSQRESL EFETTIESDT AALHGLVARL LAAVPEGAVH CLRDPTRGGL
ATTLNEITRQ SGVGMLLQET AIPVAPQVNA ACELLGLDPL YIANEGKCIV ICAAEHADAV
LDAMRAHPLG RNAARIGSVT NDPHHFVQMA TGFGGRRIVD WLSGEPLPRI C