Gene Bind_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1164 
Symbol 
ID6199231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1345626 
End bp1346684 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID641705157 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001832295 
Protein GI182678149 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG AGGAGCCGAT TTCCGGCATT GGTCCTGGCG GCCGGATCAC TTTGGCGCAT 
GGAGGTGGCG GCACCGCGAT GCGCGATTTG ATCGAACGCG TCTTTGTCGC CACCTTCCAC
CCCGAAGGGA CGCCACCGCT GGAAGATCAG GCGCGTTTCG ATCTCGCTGC CTTTGCCGCC
CATGGCGATC GGCTGGCTTT TACCACCGAC GGTTTCGTGG TCGAGCCGCT GGAATTTCCC
GGCGGTGATA TCGGCAAGCT TGCTGTATGC GGTACCGTGA ATGATCTTGC CGTGGGCGGC
GCGCGGCCCG TGGCCCTTTC TGCTGGTTTC ATCATTGAGG AAGGTCTGGA ACTGGAGCGT
CTGCGCCGGA TCGTGACCTC CATGGCGATG GAGGCCGCGC GCGCGCAGGT TCCGATTGTC
ACTGGTGATA CCAAAGTCGT CCCGCGCGGC GCTTGTGACG GCCTGTTCAT TACCACCACA
GGCATTGGCG TCATAAGGCC GGATTATCAG ATCAGCATTG CCGGCGCGCG GCCGGGTGAT
GTGATCCTGA TCAATGGGTC TCTGGGCGAC CATGGCGCGG CGATTCTCTG CGCGCGCGGT
GATCTCGCGC TTGACGTCAC GATTAAAAGT GATTGCGCGC CTTTGCATGA TCTCGCAGCG
GCTTTGCTCC AGGCGGTGCC GCAGGTGCGT GCCATGCGGG ATGCCACGCG TGGCGGTTTG
GCTGGGGTGC TGACGGAATT GGCCGAGGCG AGCCGTGTCG CTATCGGGGT GGATGAAGCG
GCTCTGCCGG TCAGATCCGA AGTCGCCGGC GTCTGCGAGA TTTTGGGCCT CGATCCGCTT
TATCTCGCCA ATGAAGGGAA ATTGGTGGCC GTGGTCGCGC CGGAACATGC GGAAGCGGCA
TTGGAGGCGA TGCGCGCGCA TCCCTTGGGT GTGGATGCGG CGATCATTGG AAAGGTCGCG
GCAGAAGGTC GGCCCGGCAC CGTGACATTG ATCAATCGTT TCGGTGGACG CCGCGCGGTC
ACGATGCCGT CCGGCGAACA ACTCCCGCGT ATCTGCTGA
 
Protein sequence
MSGEEPISGI GPGGRITLAH GGGGTAMRDL IERVFVATFH PEGTPPLEDQ ARFDLAAFAA 
HGDRLAFTTD GFVVEPLEFP GGDIGKLAVC GTVNDLAVGG ARPVALSAGF IIEEGLELER
LRRIVTSMAM EAARAQVPIV TGDTKVVPRG ACDGLFITTT GIGVIRPDYQ ISIAGARPGD
VILINGSLGD HGAAILCARG DLALDVTIKS DCAPLHDLAA ALLQAVPQVR AMRDATRGGL
AGVLTELAEA SRVAIGVDEA ALPVRSEVAG VCEILGLDPL YLANEGKLVA VVAPEHAEAA
LEAMRAHPLG VDAAIIGKVA AEGRPGTVTL INRFGGRRAV TMPSGEQLPR IC